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Summary. The authors study the problem of testing whether the distribution 
function (d.f.) of the observed independent chance variables x;,--- , zx, is a 
member of a given class. A classical problem is concerned with the case where 
this class is the class of all normal d.f.’s. For any two d.f.’s F(y) and G(y), let 
4(F, G) = sup,|F(y) — G(y)|. Let N(y | u, o”) be the normal d.f. with mean yu 
and variance o°. Let G2(y) be the empiric d.f. of 2, --+,2,. The authors 
consider, inter alia, tests of normality based on v, = 6(G2(y), N(y|Z,s°)) and 
on w, = f (Gi(y) — N(y|#, s’))’ d,N(y | Z, s°). It is shown that the asymptotic 
power of these tests is considerably greater than that of the optimum x’ test. 
The covariance function of a certain Gaussian process Z(t), 0 < ¢ S 1, is found. 
It is shown that the sample functions of Z(t) are continuous with probability 
one, and that 


1 
lim P{nw, < a} = P{W <a}, where W = | [Zo] dt. 
n—-eo 0 

Tables of the distribution of W and of the limiting distribution of +~/n v, are 
given. The role of various metrics is discussed. 


1. Introduction. Let x, , --- , x, be nm independent chance variables with the 
same cumulative distribution function G(x) (i.e., G(x) = Pia, < x}) which is 
unknown to the statistician. It is desired to test the hypothesis that G(z) is 
a normal distribution. This is an old problem of considerable interest which has 
received a fair share of attention in the literature. 

A commonly used test consists essentially in testing whether the third mo- 
ment of G(x) about its mean is zero and whether the ratio of the fourth moment 
about the mean to the square of the second moment about the mean is three. 
It is obvious that this is not really a test of normality, because there are many 
non-normal distributions which satisfy these conditions on the moments. 

Perhaps the best of the commonly used large sample tests of normality is the 
x’ test due to Karl Pearson; see for example Cramér [1], Sections 30.1 and 30.3, 
and the recent results of Chernoff and Lehmann [2]. The asymptotic power of 
the x* test was studied by Mann and Wald [3]. (It is true that these authors 
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studied the problem of goodness of fit for a simple hypothesis, which in our 
problem would correspond to knowing the mean and variance of G(x). However, 
it is plausible that the comparison in power which we will make below will be 
true a fortiori for our problem.) One of their principal results is the following. 
Define the distance 6(H,, H.) between any two distribution functions H; and 
Hy, by 


6(H,, He) = sup.|Hi(x) — H2(x)|. 


Suppose one tests the null hypothesis that G(x) = R(x), where R(x) is a given 
continuous distribution, at some fixed level of significance. In [3] it is shown 
that the x’ test based on n observations and k, intervals of equal probability 
under R, where k,, is chosen for each n so as to minimize the value A, for which 
the minimum of the power function among alternatives R* with 6(R*, R) = 
A, is 3, gives A, = cn ”° when n is large. In Section 5 we shall show that the 
result of Mann and Wald just stated also holds if 6 is replaced by the measure 
of discrepancy y introduced in Section 5. 


Let 
y(a) = . 


le 
G* (2) _— n 2 ¥2(xi) 


be the empiric d.f. of z,, --- , 2, , that is, G(x) is the proportion of z,’s less 
than x. The asymptotic distribution of 5(G, Gt) was found by Kolmogoroff 
[4] and it is known that 6(G, G*) is of the order n~”” in probability (that is, 
with probability arbitrarily close to one). If now one bases the test of the hy- 
pothesis that G(z) = R(x) on 6(R, G%), with large values of 6 significant, it 
follows that, for large n, it is sufficient that 5(R, R*) be of the order n~“” for 
the probability of rejecting the null hypothesis to be appreciable (2 4). To put 
matters a little differently: Let 6(R, R*) = h be small (so that n has to be large 
in order to distinguish between R and R*). Then, if n has to be equal to N in 
order to guarantee that the power at R* of the x’ test of goodness of fit be at 
least 4, when one uses the test based on 6 it is enough that n be of the order 
of N“°. This is a considerable improvement (for large n). The result just stated 
holds also for the classical “‘w”” test if 6 is replaced in the above by 7; this fol- 
lows from the results of Section 5. 

Let us return to the problem of testing the composite null hypothesis whether 
G(x) is normal (its mean and variance being unknown). One of the present 
authors has been developing the minimum distance method for estimation and 
testing hypotheses in a number of papers (Wolfowitz [5], [6], [7], [8]). In accord 
with this method it is proposed in [5] (page 149) that this test of normality be 
based on 6(G% , N**) with the large values critical. Here N** is the class of 
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all normal distributions, and the distance of any d.f. H(x) from the class N** 
is given by 
6(H, N**) = inf 6(H,N). 
Nen** 

In Sections 2, 3, and 4 we investigate tests based on various ‘‘distance”’ criteria, 
constructed in the spirit of the above discussion, and in Section 5 we discuss 
the asymptotic power of these tests. As stated in [5], the minimum distance 
method is not limited merely to testing normality, and in Section 4 we discuss 
its application to other tests of goodness of fit. 

Throughout this paper we discuss tests, computations, and power considera- 
tions in terms of particular (normal or rectangular) examples, but it will be 
obvious that the results of Sections 2, 3, and 5 may in general be carried over 
to testing composite hypotheses involving parametric families. The minimum 
distance method applies in principle to even more complicated families of dis- 
tribution functions about which one desires to test a hypothesis. For hypotheses 
of a more complicated nature than our examples, there will often exist the 
additional complication that the test criterion may not be distribution-free 
under the null hypothesis. 

Tests may also be constructed using other “distances” than those mentioned 
in Section 2 and above (see also [5], pp. 148-149, and [6], p. 10 in this connec- 
tion), involving other “estimators” than those of Sections 2 and 3, and involving 
other modifications of the notion of “‘distance methods”’ as motivated in this 
section. 


2. Testing normality. For convenience we divide this section into several 
subsections. 

2.1. The computation of 6(G* , N**) offers considerable difficulty; unpublished 
work on this subject has been done by Blackman. The distribution of 5(G% , N**) 
under the null hypothesis is still unknown. (It is easy to verify that, when G(x) 
is a member of N**, the distribution of 6(G% , N**) does not depend on G(z). 
Thus the composite null hypothesis determines uniquely the distribution of the 
test criterion.) In Section 4 we shall give an example of another problem of 
testing hypotheses where the limiting distribution of the minimum distance 
criterion is explicitly calculated. In the present Section 2 we shall consider 
some other “distance” tests. (For the case where either the mean or variance 
is assumed known, the test corresponding to that discussed in this section is 
being studied by Darling [18]; the suggestion of using such tests is apparently 
due to Cramér.) Let 


1 n 
#=-) x, 
1 


n 


Let N(x |Z, s’) be the normal distribution function with mean # and variance 
s’. One can base the test of normality on v, = 6(G%(r), N(x | #, s°)), with the 
large values critical. It is easy to see that, when G(x) is actually a member of 
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N**, the distribution of v, does not depend on G(x). The distribution of v, does 
not seem easy to obtain, except, for example, by Monte Carlo methods. Another 
test criterion, similar to the above but of the “w”” type, is to base the test on 


1 ,1 
UW, =- | Sen 
n Jo 


where Z,.,, is defined in 2.2.° (The idea of the “w’” test, which is defined pre- 
cisely in Section 6, is due to von Mises, with a modification by Smirnov.) Sec- 
tions 2.2 to 2.6 are concerned with the limiting distribution of nw, as n — «© 
when G is normal. The power of the tests based on v, and w, is discussed in 
Section 5. In Section 3 we treat briefly an example which illustrates the con- 
struction of test criteria which are similar to v, and w, but which may use 
“inefficient” statistics to estimate which member of N** G is (if it is); such 
techniques may have obvious practical importance. 

2.2. Let 2; , 2, --- be independent, identically distributed Gaussian random 
variables with zero mean and unit variance. Let G*(x) be as defined in Section 
1. We shall use the following notation: 


¢(x) = o(—2z) = =o" *, I@)= / o(y) dy, 


J(y) =I"(1 — y) = {x|y = 1 — I(x)}, g(x) = o(x)/I (x). 


We also let [z} = smallest integer 2 z. For 0 S r S 1, we put U,,, = [rn|th 


from the bottom among the ordered 2;, --- , x, . Finally, let* 


Zin = VnlGi (VS. (r) + En) —1] 


where 


I, = >. £3; 
NL ixl 

(Remark: S, and S', may be used equally well in the definition of Z,,, for the 
purpose of obtaining the limiting distribution; in applications S’, should prob- 
ably be used.) 

2.3. We shall show that for 0 S m1 < rm <--- < ~™ S 1, the quantities 
Z,,.., Where 1 S i S k, are asymptotically jointly normally distributed with 
zero means and covariance function (for Z,,, and Z,, asn — ©) 


(2.1) K(s, t) = min(s, t) — st — (Qn) "(1 + J(s)J(t)/2)e Or, 


We shall show here that (2.1) follows from the fact proved in Section 2.4, 
that +/nzn, Vn(S, — 1), and Wn(U,,.. — J(rs)), where 1 S 7 S k, are jointly 
3 This definition is equivalent to that given in the summary at the beginning of the paper. 
4 Elsewhere in this paper, in the summary, for example, the conventional symbol s? is 
sometimes used instead of the typographically easier (here) symbol S,. Both represent 
the same thing. Also Z, and # are used interchangeably in a manner to cause no confusion. 





GOODNESS OF FIT 193 


asymptotically normal with means 0 and covariance matrix (presented in the 
same order) given by 


1 1 ove 1 | 
T(r) T(re) «++ ST (re) | 
a eae te | 
S@uy- en |) Aedes ae J 
r(1 — rj) 
oJ (ri) oJ (r;)) ’ 


In fact, all these results are well known ([{1], p. 364, 369) except for the joint 
normality and the last k entries in the first two rows (and columns). 

Assume then for the moment that the above is proved. The event {Z,,, < z} 
may be written as {{[number of 2; which are < +/S/,J(r) + &] S mr + V/nz}. 
This in turn is the same as {U,4.,./x, »n 2 VShJ(r) + €,}, or 


{Vné, v Vind (r)(VSi “ae 1) = Vn U rasp Jain as J(r + z/V/n)) 
< VnlJ(r + 2/Vn) — J(r)j}. 


As n - ~, neglecting terms of higher order in probability, we may replace 
VnlJ(r + 2//n) — J(r)] by 2J"(r) and Wn(/S%, — 1) by 4+/n(S, — 1). 
We conclude that 


Ay = Aji = T; rs. 


lim P{Z,,.0 Ss 41, re » ane zx} 


n—-00 


(23) = lim P ty5 [/né, + 4/nJ(r))(S, — 1) 


— SVU r4251/aon = I(r; + z:/V/n))] S %, 


Thus, Z,,.., Zrs.n,°*** » Zry.n are jointly asymptotically normal, with the same 
limiting distribution as the quantities 


[V/nEn + 30/nJ(ri)(Sa — 1) — Vn(U,,n — J(rd))/I"(ri), i = 1, +++, ke 


From this and (2.2), the result (2.1) follows by direct computation and the 
fact that J’(r) = 1/¢(J(r)). 

2.4. It remains to prove the joint asymptotic normality of the quantities 
mentioned in the paragraph following (2.1), and to verify the last k entries of 
the first two rows (and columns) of (2.2). We shall in fact compute here only 
the limiting distribution of ~/nz, , ~/n(S, — 1), and ~/n(U,,, — J(r)) where 
0 <r < 1; from this will follow the desired result of (2.2), and the method of 
proof of joint asymptotic normality for the k + 2 random variables previously 
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mentioned will be evident from that for the case k = 1 considered here (note 
especially p. 369 of [1] in this connection). 

We begin by introducing a process which will enable us to simplify this com- 
putation. For fixed n, let Y;,n) be a chance variable whose distribution function 
is that of U,,, above. Let Y; , where 1 S i S nandi # [rn], be random variables 
whose joint conditional distribution, given that Yin) = y, is such that these 
n — 1 random variables are (conditionally) independent with Y; having a 
(conditional) truncated normal distribution given by 


|max(0, 1 — I(z)/I(y)], [rn] <i Sn. 


PLY; <2|¥ — {min{l, (1 — 1@))/(1 — T(y))], 1 St < [rn 
i ol Fey FT 


If now Y,,---, Y, are reordered with probability 1/n! for each possible re- 
ordering, and the resulting reordered variables are labeled Xj ,--- , X), it is 
easily verified that X{,---,X\ have the same joint distribution as the 
%1,°** , 2, considered in Section 1. We shall use the process Y;,--- , Y, to 
compute the conditional distribution of S, and Z,, given that U,,, = y = 
J(r) + w/V/n, say. Let 


{rn]}—1 n 


(el -1)2, = > ¥:;; “&— beet = > Yi, 


1 (rn}+1 


(Inr] -1)S. = D> Yi, (n—[nr)Si = D Yi. 
1 [rn)+l 
It is easy to compute that for the truncated normal distribution from y to 
mentioned above, the first four moments about the origin are 
m(y) = gly), my) = 1+ yg), 

us(y) = (y° + 2) g(y), u(y) = 3 + (y* + 3y) gy). 
The corresponding moments for the truncated distribution from — = to y are 
clearly —u:(—y), ue(—y), —us(—y), and ws4(—y). From these and [1], p. 364, 
we obtain that 


A, = V {nr] - | cr + g(—y)), B, = V [nr] “7 1(S, —-l+ y g(—y)) 


are asymptotically conditionally normal with means 0 and covariance matrix 


( — yg(—y) — [g(-y)P —(y’ + 1) g(—y)— ylg(—y)F 
—(y¥ + 1) g(-y)— ylg(—y PF 2 — Cy’ +9) o(-y) — ylg(—pP/? 


and that C, = +~/n — [nr] (X%. — g(y)) and D, = V/n — [nr] (S, — 1 — ygly)) 
are asymptotically conditionally normal with means 0 and covariance matrix 


( + y gy) — gl (y* + 1) gly) — lg 
(y+ 1) gy)— vo@P = 2+ +) 9) - vol) 
with (A, , B,) conditionally independent of (C, , D,). By using the Liapounoff 
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condition one proves that the approach to normality is uniform for any finite 
w-interval. Let O,( ) denote “order of ( ) in probability” (e.g., [9]). Noting 
that ~/[nr] — Wnr = O(1/-V/n) and that V/n(y/n) = O,(1/+/n), we obtain 
as an expression for +~/nz, , given that U,,, = y, 

Vnin = VrAnt VI — rn + Vnl—-rg(—y) + (1 — r) gy) + 0,(01/-V/n). 
But 


—rg(—J(r) — w/Vn) + 1 — 1) g Jr) + w/V/n) 


_ wv wer 1 
Ti 1A bet +0,( ), 


n 


so that 


fad: JE. + VIF, + ORs o,(+.). 
r(l — r) a/n 


In similar fashion, we obtain 


VilS, — 1) = VABe + VI=*Dy + ws() PLOT 5 0,( 1). 

rl — r) — 1r) /n 
In both cases the O0,(1/+/n) term is uniform in every bounded interval of w. 
Since W, = W/n(U,., — J(r)) is asymptotically normal with mean 0 and var- 
iance r(1 — r)/{¢(J(r))]’, the desired asymptotic joint normality follows easily 
from the last two displayed expressions. The covariance of ~/nz, and W,, is 
most easily computed as E{W,-E{+/né, | Wa}}, that of ~/n(S, — 1) and 
W,, being computed similarly; these give the desired results for the last k entries 
of the first two rows (and columns) of (2.2). 

2.5. We now show that there exists a representation of any Gaussian process 
with mean identically zero and a covariance function like that of (2.1), whose 
sample functions are continuous with probability one (w.p. 1). 

Let W(t), 0 S ¢ S 1, be the Kac-Siegert representation ({12]) of a Gaussian 
process with continuous covariance function K'(s, t). Let {A,} be the eigenvalues 
and {¢,(t)} the corresponding normalized eigenfunctions of K’(s, t). Suppose 
furthermore that g(t) in L’ is such that 


K’"(s, t) — g(s)g® = K"(s, t) 


is a covariance function. We shall show how to construct explicitly a process 
Z(t) with covariance function K”(s, ¢) in terms of the process W(t). (All processes 
studied in this section are to have mean zero.) 

We first prove two lemmas. 

Lemma 1. A necessary and sufficient condition that K”(s, t) be positive definite is 
that 


1 


B= De « apiiir (gy =| g(t) gx(t) dt. 
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(If a X, is 0, it follows immediately from the positivity of K”(s, ¢) that g(t) is 
orthogonal to all the eigenfunctions belonging to \,; thus the corresponding g, 
vanish and we interpret g,”/d, as zero.) 

Only the necessity of the condition is used in the application of the lemma. 
The proof of sufficiency is included because it is so brief and sufficiency seems to 
be of interest. 


1. Necessity. Set ¥(t) = >ofv gx(t) and note that by definition 
1 1 
[ [ G0 - 0 gO) Ve) vO asat = 0. 
Evaluating the double integral we obtain 


X vide = (= Uk ) 


and setting vu. = gx/dAx 


gi “~ gi\ 
bee2(L8). 


Thus >: gi/Xx <= 1 for every n and the theorem follows. 
2. Sufficiency. If )°fgi/\. < 1 we have 


(San) = (Eavn sy «(bal(Es) sen. 


and positiveness of K’(s, t) — g(s) g(t) follows. 

Lemma 2. The series >-? gugx(t) cae uniformly (to g(t)). 

We have | Dingwr(t)| S VW" g/ry Vy dr (t). By Mercer’s theorem, 
p by wilt) = K(t, t). Hence re gex(t)| S V Kit, 0) vy gi/N . Since 
= g/d converges (by Lemma 1), Lemma 2 follows. 

We are now ready to prove that 


24) 2 = ww -S-NB —#) 9) OH | WO enlo at 


has the covariance function K”(s, ¢). 
Note first that the chance variables 


{= | W (t)en(t) at} 


are independently and normally distributed, with mean 0 and variance 1. Since 
> g/d < ©, the series 


. 5 7 | W(t)yn(t) dt 


converges in the mean (and even w.p. 1) and thus defines a random variable. 
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To calculate the covariance of Z(t) we need 
2 1 } 
E(w 2 | W(Her(l) ath. 
kml Ax Jo ) 


But E{ W(t) fo W(Ogx(t) dt} = fi E{W(t)W(t)}er(t) dt = fi K’(t, te(t) dt = 
divx (t), and hence 


2 1 2 
ELW) & [ Weld ath =F nent = o(0. 
The covariance function of Z(f) now comes out to be 
K'(s, t) — 267°(1 — V1 — B) g(s)g(t) + 6°71 — V1 — BF)’ gs) 
= K'(s, t) — g(s)g(t) = K’(s, d). 


Since g(t) is continuous (by Lemma 2) it follows, in particular, that the sample 
functions of Z(t) are continuous w.p. 1 if the sample functions of W(t) are con- 
tinuous w.p. 1. Now let K’(s, t) = min (s, ¢) —st. Then the sample functions of 
W(t) are continuous w.p. 1. The application of the above result twice (once for 
each remaining negative product in (2.1)) then proves that the representation 
Z(t), which is Gaussian with mean zero and covariance function (2.1), has con- 
tinuous sample functions, w.p. 1. 

2.6. From the results of Sections 2.2 to 2.5, it is easily verified that the demon- 
stration given by Kac on pp. 197-198 of [10] for the case K(s, t) = min(s, t) — st 
carries over with only slight modifications to the case now under discussion where 
K(s, t) is given by (2.1). We conclude that 


(2.5) lim P{nw, < a} = P{W < a}, 

where W = fo [Z(t)}’ dt and Z(t) is Gaussian with covariance function given by 
2.1) and sample functions continuous w.p. 1. Modifying slightly the technique 
of Kac and Siegert [11], [12] as applied on pp. 199-200 of [10], we now study the 
distribution of Z. We write 


(2.6) K(s, t) = K*(s, t) — hi(s) Ai(t) — he(s) helt) 62 2t Il, 


where K*(s, 2) = min(s, t) — st and hy(s) = (24)! "[J(s)/~/2P te ** for k 
= 1,2. Let Ai, \2, --- be the eigenvalues (all positive, since K is positive 
definite) of the integral equation 


(2.7) | K@w) &@ ay = ro). 


Following the demonstration of [10], we conclude that the characteristic func- 
tion of W is 


(2.8) Ee™® = JJ (1 — 2ita,)~”. 
j=l 
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Thus, we may express W as 
(2.9) W=DR;, 


where the R; are independent and R; has density function 


1 —r/2hj 
VJ Dark, e€ r. > @ 
(We remark that it may of course be easier to obtain the convolution with itself 
of the distribution of W rather than the latter itself. This could be used if one 
based a test of normality on the sum of two W,,’s, each computed from half the 
sample of size n = 2m.) 

We now give a procedure for finding the A; . For any eigenvalue \ and corre- 
sponding eigenfunction ¢ (not necessarily normalized) of (2.7), write 


(2.10) 


1 
(2.11) C. = [| hw) oy dy, i= 1,2 
~0 
We can rewrite (2.7) as 
1 
(2.12) | K*(a, y) e(y) dy — Cilu(a) — Crho(x) = A g(z). 
Differentiating twice with respect to x and writing » = 1/d (we may consider 
u > O in the sequel), we obtain 
(2.13) eo" (x) + welx) = —Cy®hi (x) — Cou’hz (2). 
Any eigenvalue \ and eigenfunction ¢(x) of (2.7) satisfy (2.13), (2.11), and 
(2.14) ¢(0) = g(1) = 0. 


Conversely if \ and ¢(x) satisfy these conditions they are an eigenvalue and 
eigenfunction of (2.7). For let 


1 
| KG@,y) oy) dy = dO). 
0 
As we obtained (2.13) we get 
6” (xz) + wo(x) = —Cw hy (x) — Cau“hs (2). 
Hence 6”(z) = $”(x), and since also 6(0) = 6(1) = O we have @(z) = ¢(z). 
Our problem now is to find a value uw for which chere exist a function ¢(z) 
and constants C; and C, satisfying (2.13), (2.14), and (2.11). For given C,; and 
C, the general solution of (2.13) can be written 
(2.15) ¢(z) = Asin wz + Bos ur — wCigi(x) — wCrg2(z), 


where 


(2.16) g(x) = / “He sin u(t — 2) db. 
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Applying conditions (2.14) and (2.11) gives 
0 = B — uC gi(0) — nCg(0), O=Asinu + Beoosy — pCgi(l) — uC.g2l1), 
1 1 
C,;=A [ hx) sin px dx + B | h(x) cos ux dx 
“0 0 
(2.17) 


el 


—nC; | hi(x) gi(x) dx — uC | hi(x) go(x) dx. 


These equations have a nontrivial solution for A, B, C,, and C, if and only if 
the determinant D(u) of the coefficient of these four quantities is zero. Hence 
the eigenvalues of (2.7) are determined by the roots of D(u) = 0. 


The following method of computing D(z) is due to R. J. Walker, to whom the 
authors are greatly obliged. 


We first note some pertinent properties of h,(x) and g;(z). 
h(l— 2) = hi(z), (0) = 0, (4) = 1/V/2e, (4) = 0; 
hol — x) = —he(x),  he(0) = 0, —na(4) = O, ho(}) = 1/2; 


1/2 1/2 
ga(l— 2x) = hi (é) sin w(t — 1 + 2) dt = hi (1 — s) sin u(s — x) ds 
l—z z 
= gi(z); 
and similarly 
gl — xz) = —g,(z). 
It follows that gi(1) = g:(0) and g.(1) = —g2(0), and 


fo [ nieroce 


| 


\9, 


Also, using sin ux = sin 3u cos u(} — x) — cos du sin u(} — 2), we get 


[ hi(x) gj(z) dx = 


1 1/2 
I its) cin pe de @ din I hilz) cos w(§ — 2) de, 
0 0 
el 


1/2 
| he(x) sin ux dx —2 cos du | he(x) sin p(4 — x) dz, 
0 0 


with similar reductions for the coefficients of B in (2.17). 


Introducing these simplifications we get by direct computation D(u) = 
—2D,(u) De(u), where 


1/2 
D,(u) = cos du E + 2y | hy(x) gi(a) az | 


(2.18) 


1/2 
—2yg: (0) | hi(x) cos u(} — x) dz, 





M. KAC, J. KIEFER, AND J. WOLFOWITZ 


1/2 


D(u) = sin 4p E + Qu hex) go(zx) az | 
(2.19) " 


1/2 
—2ug2(0) ; he(x) sin u($ — x) dz. 


These equations can be put in a form more suitable for computation. Integrat- 
ing (2.16) by parts gives, for i = 1, 
u , pu 
g(x) = — Van 78 u(t — x) + uh(z) — we hi(t) sin w(t — x) dt 


1/2 
g(0) = — Viz cos ku — : hi (é) sin ut dt. 


Putting these in (2.18) gives 


1/2 
D,(u) = cos du E + 2p" hi(x) as | 
0 


+ 2,8 | If a(x) he(O c00 pl — 2) sin pt dA 
8+T 


- I hi(x) hi(t) sin w(t — x) cos dy aa, 
T 


where S is the triangle bounded by the lines ¢ 
the triangle bounded by x = 0, 


= 0, ¢=2,andz = },andT 
x = t,and? = 3. 
T reduces to 


The sum of the integrals over 


[f @ md 08 u(} — #) sin ux dA, 
T 
which equals the integral over S. Hence, finally 


el/2 


D,(u) = cos 3u 1 + 2°  Bi(2) as | 
“0 
(2.20) 


~1/2 1/2 

+ 4,° | hy(x) sin ux dx | h(t) cos u(4 — #) dt. 
/0 z 

Similarly, (2.19) becomes 


a1/2 
D2(u) = sin 3y E + 2° | h3(z) as | 


1/2 1/2 
he(x) sin ux ax | h(t) sin w(4 — 6) dt. 


The method just developed for obtaining the eigenvalues of (2.7) seems more 
accurate and computationally simpler than other methods, such as those em- 
ploying trial functions. In Section 6 the smallest few zeros of the functions 
D,(u) and D2(u) of (2.20) and (2.21) are tabulated, and an approximation for the 


+ 4° 
0 
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distribution of W is thereby obtained. This approximation is compared with 
empirical distribution functions of nw, obtained by sampling. Empirical distri- 
bution functions of nw, and +/nv, are tabulated, and certain other interesting 
results of the sampling experiments are noted (for example, the joint distribution 
of the classical Kolmogoroff and von Mises statistics D, and w;, , which are de- 
fined precisely in Section 6). 


3. Tests using quantiles. The relative ease of computing the limiting dis- 
tributions of various possible test criteria of the type considered in this paper 
will of course depend on the particular problem. Thus, the use of the sample 
mean and variance in non-normal cases may lead to more complicated results 
than those of Section 2. A tool which may be used in all cases (where the hy- 
pothesized family has finitely many natural parameters, is normal or not, where 
a simple sufficient statistic does or does not exist, etc.) of density functions, with 
about equal complexity in all cases, is the use of sample quantiles (as many 
as necessary) to estimate the “true” d.f. if the null hypothesis is true. (Of course, 
the more unknown parameters and hence quantiles which must be used, the 
messier will be the result.) For computational reasons, tests constructed in this 
manner will sometimes be more practical to use than those involving a sufficient 
statistic. The results on power in Section 5 apply also here. 

As an example, suppose the pth sample quantile U,,,,0 < p < 1, is to be used 
to estimate the corresponding population parameter of a family of d.f.’s F(a — @) 
for —« < @ < o, in testing whether or not the “true” d.f. is a member of this 
family. We suppose without loss of generality that F(0) = p, and we denote by 
f the density function of F. We assume f to be continuous and positive in a 
neighborhood of 0. 

Then letting y = F and Z,., = VnlF.(¥(r) + U,..) — 1), an argument 
like that of Sections 2.3 and 2.4 leads easily to the conclusion that, for 0 S$ 
ry S +++ S ~% S 1, the limiting distribution of Z,,,,, 1 S ¢ S k, is the same 
as that of 


V/nlU pn rr [Us,. oe v(r;)]] /v'(ra), 


and, in particular, is Gaussian. Putting y(r) = f(p(r)) and 


we obtain for the limiting covariance function, for 0 S s,t S 1, 


. i p(l — p) _ min (p, s) — ps 
Kt, 0 = v0) vo | MO); 70) ¥@) 


(0) y(t) v(s) y(t) 


aa | _ {min (p, 8) — ps|{min (p, t) — pil 
= g(s) g(t) ore a 


_ min (p,t) — pt , min (s,t) — J 


+ min (s, t) — st 
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(min (s, t) — st/p s,t S p, 


(s — p)(t — p) 
:—. 


0 otherwise. 


= g(s) g(t) + {min (s — p,t — p) — s,t = p, 


For computing purposes (e.g., by Monte Carlo methods), the last form of the 
covariance function is useful. Let X,(¢) and X.(¢) be Gaussian, each with the 
familiar covariance function min (s, ¢) — st, and let X bea normal random vari- 
able with mean zero and variance 1, where X, X; , and X2 are independent. De- 
fining X,(t) = X.(t) = Oif t < Ooré > 1, let 


; ~{[t ——— ,./t— 
Z(t) = g(t)X + vp x‘) +Vl—p x{P), Osts1. 
Then Z has the covariance function K(s, ¢). As an example, if F is normal with 
unit variance and p = 3, we have 


sa 1 1 
(Zt) = [te **? = min (t,1 — O)X + Va X,(2t) + Va X2(2t — 1). 


4. The rectangular distribution. In Section 4.1 we give an example where 
the minimum distance method statistic can be computed explicitly; in Section 
4.2 we comment on the limiting distribution of test criteria of the type treated 
in Section 2 in the present case. 

4.1. Let 


(0 
F(z; 0) =(x —0+ 3 


i 

2) 

+ 2; 
1 6+ 4< z, 

and let R denote the family of all such distribution functions for —-» < @< ~. 

It is desired to test the hypothesis that z,, 22, --- , 2, are independently and 

identically distributed according to some member of R. We will be concerned 

with computations when this hypothesis is true (see Section 5 for remarks on 

power which apply also here), and denote by G the true member of R (i.e., 

the distribution function of X,). G* is as defined in Section 1. Let 

Dt = sup, (Gi(x) — G(z)), D, = sup, (G(x) — Gi(x)). 

In the present example the minimum distance criterion is easily seen to be 

(4.1) 5(G% , R) = inf, sup, |G2(x) — F(a; 6)| = 4(Dt + Dj). 

The joint limiting distribution of ~/nDt and W/nD; (as n > ©) is given by 

Doob ({13], p. 403) for z,y > 0 as 

lim P{~/nDz < x, VnDt S y} 


n--2 


6 
6 


oo 
_ > (gente + eo timat(m—De a Qe trot) 


m=) 


= G(z,y), say. 
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Let g(x, y) = 0°G(x, y)/dxdy. The series in (4.2) is uniformly and absolutely 
convergent and differentiable with respect to x and y outside of any neighborhood 
of the origin. Using the fact that the mixed derivative of the expression gem: 
is 4ab(u? — 1)e~“"”? where u = (2ax + 2by)*, we obtain for the mth term of the 
series for fo g(y, z — y) dy the expression 


4m(m — 1)[H(2mz) — H(2(m — 1)z)] + 4m(4m’2 — 1)H(2mz), 
(4.3) 


H(z) = / (ue — 1)e" du = ze*"”. 


The density function corresponding to the limiting distribution function of 
B, = Vn(Dt + D7) is a sum of expressions given in (4.3). For ~/né(G*, R) = 
3B, = U, say, the expression corresponding to (4.3) is 


(4.4) 8m(m — 1)[H(4mu) — H(4(m — 1)u)] + 8m(16m2u? — 1)H(4mu). 


The series is absolutely convergent and in the sum of (4.4) from 1 to ©, the 
coefficient of the expression H(4ku) for k = 0 is 8k(16k’u> — 3), so that the 
density function of the limiting cumulative distribution function of U is 


(4.5) 32u >) m(16m'u? — 3)e°"™. 

m=1 
Outside any given neighborhood of the origin, all terms of (4.5) are positive 
except for a finite number. Thus, integrating (4.5) we obtain for u > 0 


(4.6) lim P{+/ns(G*, R) < u} = 1 — DY (32m? — 2)". 
n—00 m=1 
It is also of interest that in this example we can compute the limiting distri- 
bution of the minimum distance estimator of 6, namely, the random variable 
T, = T,(t1, «++ , tn) defined by 


(4.7) 8(GR(y), F(y; T.)) = 6(G2, R), 


which is satisfied by T,, — 6 = 3(D; — Dt) when @ is the true parameter value. 
An analysis similar to that given above shows that, for ¢ > 0, 


(4.8) im Piv/aiT. — 0 $e 1-9 Dh. 

n—00 m=1 4m? — | 
Of course, in this simple parametric example there are estimators of order 1/n 
in probability; in estimation problems, the minimum distance method is most 
useful in examples of a more nonparametric nature, where it often yields con- 
sistent estimators when other methods do not. 

4.2. It is interesting to note that in the example of Section 4.1 and other similar 
cases it is simple to design along the lines of Section 2 a test of whether or not 
the unknown d.f. belongs to the specified class, where the limiting distribution 
of the test criterion when the null hypothesis is true is already known. These are 
the so-called “‘irregular’’ cases of estimation where an estimator of the unknown 
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parameter(s) indexing the class exists whose deviation from the true parameter 
value is of lower order in probability than the usual 1/+/n encountered in “regu- 
lar’ cases; for example, of order 1/n for the rectangular distribution with un- 
known location and/or range, or for the exponential distribution with known 
scale but unknown location parameter. For the sake of definiteness, we fix our 
attention on a Kolmogoroff-type criterion for the latter example, although the 
result applies equally to other distributions and other criteria (w’-type tests with 
different ‘‘weight’’-functions, etc). Thus, the problem is to test, on the basis of 
n observations, whether or not the true d.f. is of the form 


0,2<6 1-e€°", «28, 


for some real 6. Let T, = min(z, --- , z,). Then P,{lim,.../n(T, — 0) = 
0} = 1. Hence, if we compare the sample d.f. with the exponential c.d.f. as 
estimated by using 7’, , by computing 


B, = Vn sup |F, (1. + log -1.) —T1r, 
0<r<l l-r 

we may conclude that, when the null hypothesis is true, B, has the same limiting 

distribution as the Kolmogoroff statistic. 

Remarks on power like those of Section 5 apply also to the present case. The 
present remarks may also be modified to apply to situations where one but not 
all parameters have irregular estimators, for example, for the case of the ex- 
ponential distribution with unknown location and scale. For the case of the rec- 
tangular distribution with unknown range studied in Section 4.1, it seems in- 
tuitively reasonable that a test constructed in the manner of the present section 
may be more powerful than the one considered there. 


5. Asymptotic power of the tests of normality. The results of this section are 
carried out for the tests of normality mentioned in Section 2, but the remarks 
below concerning v, may also be carried through for the minimum distance 
test, the test of Section 3, and in many other examples, and the remarks con- 
cerning v, and w, may be extended to many other “distance” criteria. 

First we consider the test of size a based on v, . The critical region is of the 
form {v, > b(a)/+/n}, where b(a) is a constant, except for terms of lower order 
in n. Suppose G(x) = Ro(x), and that 6(Ry , N**) = d/+/n. From the theorem 
of Kolmogoroff [4] we have that 5(Ry , G*) is of the order 1/+/n in probability 
(uniformly in Ro). Hence, for 0 < 8 < 1 there is a number d* = d*(a, 8) such 
that d > d* implies that 5(G% , N**) > b(a)/+/n with probability 2 1 — 8. 
From the definition of 6(G* , N**) we have 


5(G* , N(y | #, s°*)) = 8(G% , N**). 


Thus, if we are using the test of size a, the power is at least 1 — @ for any al- 
ternative Ry whose distance from N** is = d*(a, 8)/+/n, and the power of the 
test at Ry approaches one as +/né(Ro , N**) increases indefinitely. This is a re- 
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sult of the same order as obtains for testing goodness of fit of a simple hypothesis 
by use of Kolmogoroff’s distribution (Section 1). 
We now consider the “‘w"-type” test criterion w, .° We consider the function 


(5.1) oF, H) = { [(w@ - aera (Fe) + eer)" 


as a possible measure of discrepancy between two d.f.’s. This measure has been 
used by Lehmann [14] and others. (We remark that y is not a metric, since it 
does not satisfy the triangle inequality. Another undesirable property of 7 is 
that the discrepancy between the d.f.’s of two random variables X and Y may 
not be the same as that between the d.f.’s of —X and —Y. Also, the failure of 
the formula for integration by parts in expressions like (5.1) necessitates slight 
complications, for example, in the second following paragraph. Neither of the 
last two difficulties is present if both d.f.’s are continuous. If the d.f.’s have 
jumps which are nowhere dense, one could eliminate these last difficulties by 
redefining y, for example, by replacing each jump by a constant density over an 
interval of width e about the jump and letting « — 0 after integrating. The de- 
velopment which follows would not be materially altered by such a change in 
the definition of +.) 
If F and H are continuous, 


[we - Her ar@ - He)! =o, 


and the integration in (5.1) may be carried out with respect to either F or H 
instead of 4(F + H). Hence, if G(x) = Ro(x) is continuous, 


ui? ={ [ 1G%@) - N@lz,8)F a@l2,8)} 


{ / [Ro(x) — N(x| 2, s*))’ d, N(x | 2, a} 


—{ J theta) - Grey ate |2,8)}" 
= x(Ro, N**) i 5(Ro, G*), 


where 


y(Ro, N**) = inf y(Ro, N**). 
Nen*e* 


Now, 5(Ro, G2) is of order 1/+/n in probability (uniformly in Ro) and the critical 
region based on w, is of the form {+/w, > c(a)/+/n}, where c(a) is constant 
except for terms of lower order in n. Hence, using (5.2), an argument like that 

5 In an unpublished manuscript, T. W. Anderson considers similar criteria for testing a 


simple hypothesis, and obtains similar results on asymptotic power. See also his abstract 
(Ann. Math. Stat., Vol. 25 (1954), p. 174). 
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of the previous paragraph shows that there is a value d’(a, 8) such that the test 
of size a has power = 1 — 8 for any continuous alternative Ro for which 
y(Ro, N**) = d’(a, B)/V/n. 

We now consider what happens if Ry is not continuous. It is easy to show, by 
consideration of the contribution to the integral of (5.1) at discontinuities of H, 
that, if F is continuous, 


[@-m*ar= [ -m an, 


[@ — H)' dF = af@- wy a(Pt#), 


9 


~ 


Hence, if Ry is not continuous, the argument of the previous paragraph need 
only be altered by inserting the factor ~/2/5 before y in the last expression of 
(5.2). The ensuing discussion of power then proceeds as before. 

It is interesting to compare the measurements of distance 6 and vy. Clearly, 
y(F, G) = 6(F, G) for all F,G. On the other hand, if there is a value x» for which 
F(x) — G(xo) = 4, it is clear from the monotonicity of F and G, using (5.1), 
that [y(F, G)|’ = 4{6(F, G)]*. Thus, we have 


(5.3) Vi/6e? <7 4, 


where both equalities are attainable. We conclude that, whereas for any given 
8 > O, the power exceeds 1 — 8 for alternatives whose distance from N** is of 
order 1/+/n either according to 6 for the test based on v,, or according to y for the 
test based on w, , the distance in terms of 6 must be of order 1/~/n (in the worst 
case) to insure this for the latter test. For the former test, y-distance of order 
1/-/n suffices. 

We next verify the property of the x’-test relative to y which was stated at the 
end of the third paragraph of the introduction. For brevity we shall use the nota- 
tion of [3] without redefining symbols here; the reader may also refer to [3] for 
details of the argument which we omit. Suppose then that we are testing the 
hypothesis G(x) = R(x) = x for 0 < x S 1 by means of the x’-test based on N 
observations and ky intervals of equal length on the unit interval. If now 


Gta) = Fay) = XA ve (22+) 


(see Section 1 for the definition of y,), then y(Fiy, R) = 1/kw V6. Since Fix 
assigns the same probability as R to each of the ky intervals, the power of the 
test against the alternative F,, is just the size of the test (assumed to be < }). 
We conclude that if the test gives power 2 } for all alternatives R* satisfying 
7(R*, R) = Tw, then we must have 


(5.4) kw > 1/TwV6. 
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(This is not the best possible inequality, but it suffices for our proof.) Consider 
now the distribution function 


{a + 2a)z, 0s2 38 }, 


(5.5) H,(z) = 
\2a + (1 — 2a)z, a 


’ 


where 0 < a < }. 

A simple computation shows that y(H., R) = a/+/3 = ya, say, and that 
(assuming for simplicity that ky is even) when G = H, we have > p; = 
(1 + 4a’)/ky = (1 + 12y3)/kw . Hence, the function y of [3] is given, when G = 
H,, by 


(5.6) o'(ky) = 12(N — 1) y2 — Cr/2(ky — 1). 


In order that the x’-test based on ky intervals have power = } for all alternatives 
R* with y(R*, R) = Tw, it is necessary that the expression (5.6) be 20 asympto- 
matically when we put y. = 'y, and that (5.4) be satisfied. We thus obtain, when 
N is large, 


(5.7) ly = Cc’n™ 


where C’ is a positive constant. From the result of [3] and the fact that y < 4, 
we see that the reverse inequality to (5.7) is (for a different C’) also true. Thus, 
N~*" is indeed the smallest order of 'y which will give appreciable power for all 
alternatives R* with 6(R, R*) = Ty. 

We shall now summarize the results proved thus far in this section. It is not 
known how the power function of the x’-test for composite hypotheses behaves, 
but it is plausible that the power function when testing a composite hypothesis 
by means of the x’-test (in any of its variations) is no better (in the sense we have 
used in measuring the goodness of a power function) than when testing a simple 
hypothesis. We have shown that if I and A are small and the x’-test of size < } 
of a simple hypothesis G = R requires N observations to insure power 2 3 at al- 
ternatives R* for which y(R*, R) = T (or 8(R*, R) = A), so that N = C,r*” 
(or N = C.,A~°”), then the numbers of observations required by the Kolmogoroff 
and w tests to achieve the same minimum power at T (or A) are at most 


Kolmogoroff : n = C,N** = Or” (n = C,N**® = C,A~”) 
w! n = C,N** = C,r™ (n = C,N%*® = Cyd). 


The numbers of observations n required by the tests based on v, and w, in testing com- 
posite hypotheses about parametric families are the same functions of T or A as for 
the Kolmogoroff and w’ tests of simple hypotheses. The test based on v, may thus 
be expected to be superior to the x’-test in the sense of both y and 6, and that 
based on w, may be expected to be superior in the sense of y, at least for large N. 
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It might be supposed that the x’-test would show up to better advantage rela- 
tive to the test based on v, in terms of a metric like 


n(Ro, N**) = inf | d|Ro — NI. 
Nenes 

However, this is not so; for fixed n, even if n(Ry , N**) is near its maximum of 2, 
neither the best x’-test of [3] nor any of the other tests we have mentioned need 
have appreciable minimum power (2 4), and 6(Ry, N**) and y(Ry, N**) can 
be arbitrarily small. In fact, it is easy to see that no test can have the infimum 
of its power function over all alternatives Ry with n(Ry , N**) = C > 0 greater 
than the size of the test. (If N** were a simple hypothesis, this would still be 
true.) In order better to compare the behavior of tests in terms of the metric 7, 
we might therefore restrict our consideration to alternatives Ry belonging to 
some regular class, for example, the class of d.f.’s with densities which cross that 
of each member of N** at most M times. Under such a comparison the test based 
on v, may be shown to be superior to that based on the x*-test, in the same sense 
as under our previous comparison. 

The discussion of this section suggests very strongly that there is a “natural” 
distance with respect to which the power characteristics of a particular ‘‘distance”’ 
test criterion should be measured, and that a comparison of the power of such 
tests in terms of their own and other distances indicates that tests correspond- 
ing to strong metrics have the best global power characteristics. It is hoped to 
investigate this idea further. 


6. Numerical results. In this section we list some pertinent experimental and 
computational results. The main purpose of the sampling experiments was to 
obtain estimates of the distributions of nw, and +/nv, which may be used in ap- 
plications to test normality. As a check on the sampling experiments the same 
data were used to compute experimentally the d.f. of Z and the d.f. of ~/nD, 
with n large, D, being defined by 


D, = 8(L(x), L*(2)) 


where L*(x) is the empiric d.f. of n independent chance variables with the df. 
L(x). Also, as another check on the experimentally obtained d.f. of nw, , an 
approximation to the d.f. of W was computed, using (2.9). 

Define 


ds / (L(x) — L*(z))? dL (a). 


The sampling experiments were conducted using 400 samples of size n = 100 
and 400 samples of size n = 25, of random standard (mean zero, variance one) 
normal deviates from the well-known Rand Corporation series. From each sam- 
ple the values of the sample mean (#), sample variance (s°), v,, w,, D, , and 
w,, were computed. Thus, there were obtained 400 “observations” on +/7nv, 
and nw, for n = 25 and n = 100, and the sample d.f.’s based on these observa- 
tions serve as estimates of the d.f.’s of +/nv, and nw, . The known df. of 
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and the known d.f. of ~/nD, were compared with the experimentally obtained 
d.f.’s of € and ~/nD,, as a check on the experimentally obtained d.f.’s of ~/nv, 
and nw, . 

It was found that the experimentally obtained d.f. of Z agreed well with the 
known d.f., and the experimentally obtained d.f. of ~/nD, agreed well with the 
d.f. of ~/nD, as tabulated in [15]. In each case, the maximum difference between 
the experimentally obtained and the known actual d.f. was found to be small 
according to the tables [15]; this is the basis for saying the agreement was good. 
The agreement between the limiting df. of nw%, (tabulated in [16]) and the ex- 
perimentally obtained d.f. was found to be fairly close for n = 100 but not close 
for n = 25; the upper tails of the distributions seem to be the parts which come 
into agreement most rapidly with n. The distributions of +/nv, and nw, are 
concentrated closer to the origin than those of ~/nD, and nw>,, respectively. 
This is not entirely surprising since the covariance function (2.1) is smaller than 
that for the process corresponding to D,, and w;, (namely, min (s, 1) — st). Scatter 
diagrams of the sample values of (1/nD, , nw) indicate that the regression of 
nw, on ~/nD, is roughly parabolic with the conditional variance of nw., increas- 
ing with the value of +~/nD, ; this is not entirely surprising in view of the way in 
which w’, and D, are computed from a sample. Although v, and w, are correlated 
with D, and w;, , they seem to be more strongly related to Z. 

In Tables I and II, respectively, are given the estimates of the d.f.’s of ~/nv» 
and nw, ; commonly used percentage points are listed for convenience in Table 
III. 

We now turn to the limiting distribution of nw, . The first eight zeros of 
D(u) are alternately zeros of D;(u) and D.(u). Their values, obtained by numer- 
ical computation of these functions for various values of yu, are 7.38, 8.62, 13.66, 
15.14, 19.91, 21.52, 26.16, 27.87. The corresponding values of \; for 1 £7 3 8 
are .01836, .01346, .00536, 00436, .00252, .00216, .00146, .00129. It is to be noted 
that d2;-1 and 2; are both approximately c/;’ (the corresponding property for 
the eigenvalues arising in the computation of nw, is that \; = 1/2°j*). This 
(inferred) speed of convergence of the A; to 0 implies that the distribution of 
W* = >A \,;v; should be a fairly good approximation to that of W = Draws, 


TABLE I 
Estimate of Q,(z) = P{/nv, S x}. 








Qas(x) | Qro0(x) 
| 


* 


Qes(x) Qioo(x) 


0 | 0 Gy | $425 


-0125 -0025 
-0250 

.0800 

-1975 

3300 

4775 

-6750 

7325 


9025 
9350 
- 9600 
.9775 
9900 
9925 
.9975 

1.0000 


Seenea 


° 
a 





i 
o 


_ 
| on 
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TABLE II 
Estimate of R,(z) = P{nw, S x} and distribution function H(z) = P{W* Ss z}. 
x Res(x) | Rioo(x) H(x) 





.01 0 .108 
.02 ] ; .297 
.03 | : 471 
.04 ; : .609 
05 AE ; .712 


.06 . 582! é .787 
.07 j F 843 
.08 .725 ; .883 
.09 | 7 : .914 
.10 a | ; .937 


ll ' .953 
12 .9150 . 966 
13 9425 s 975 
14 9525 982 
15 .9650 | .9850 .987 


16 \ .9700 .9850 .997 
17 | -9800 .9875 
18 | .9875 .9875 
.19 .9875 . 9900 
.20 .9900 .9950 


21 | 9950 9950 
22 | 9975 9975 
23 | 1.0000 1.0000 








TABLE III 
Estimates of common percentage points of Q, and R,. 


| | 
Qu-(p) Qioo™!(p) | Res (p) Rioo (p 


7435 .729 .0909 0824 
8225 .797 1145 1019 
.8980 .878 1352 1240 


o2 | 9685 954 1671 1386 
.O1 1.0145 .989 .1957 . 1859 
.005 1.0465 1.062 . 2053 .1957 
and the distribution of W* was therefore computed, using the method of [17] 
(here \ vy; is the R; of (2.9)). The results are given in the last column of Table IT. 

From the fact noted above regarding the speed of convergence of the d.f. of 
nw, , and the fact that Table II indicates (in the difference Rioo(x) — Res(x)) a 
much slower approach to the limiting distribution for R,(x) than that noted in 
our experiment for the w,-distribution, we would expect that the d.f. H(z) of 
W* should lie above the estimates R,(x) of Table II, being close to Rioo(x) only 
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in the upper tail. This is what the last column of Table IT actually shows, and an 
idea of how good the agreement is in the tail may be obtained by computing 
M(x) = 20(H(x)\""[H(xz) — Rro(x)]; for = .13, .14, .15, .20, one obtains 
M(x) < 1, which indicates very good agreement. Thus, in applications where 
n is large, it seems reasonable to use the last column of Table II, especially in 
the upper tail (which is the region that matters for statistical tests). 

We are indebted to Prof. R. J. Walker and to Mr. R. C. Lesser of the Cornell 
Computing Center for carrying out the sampling and the computation of the 


\;’s. The experimental data are available in the files of the Cornell Computing 
Center. 
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termines a... .”’’ 





SOME CLASSES OF PARTIALLY BALANCED DESIGNS! 


By R. C. Bosse anp W. H. Ciatrwortuy? 
University of North Carolina 


1. Summary. Incomplete block designs with a few replications are of practical 
interest to experimenters. Partially balanced incomplete block (PBIB) designs 
with two associate classes, and k > r = 2, were studied by one of the authors 
[1]. The present paper extends this investigation to the case k > r with \, = 1 
and A: = 0. It is shown that the parameters of all PBIB designs in this case 
are given by (4.27) and thus depend upon three integral parameters k, r, and f, 
with the additional restrictions that 

(i) l Stsr, 

(ii) rk(r — 1)(k — 1)/t(k + r — t — 1) is a positive integer. 

For the particular case r = 3 it is shown that all designs with ¢ = 2 or 3 
necessarily exist, but if ¢ = 1, then the only possible value of k > r is 5. How- 
ever designs with parameters (4.27) with r = 3, ¢ = 1, and k = 2 or 3 are 
also combinatorially possible though they do not belong to the class k > r. 

Interesting by-products of this study are a lemma and five corollaries which 
give an insight into the structure of PBIB designs with \, = 1 and X» = 0, 
and Pi = k — 2, no special assumptions being made regarding r and k. 


2. Introduction. PBIB designs with m associate classes (m = 1) were intro- 
duced by Bose and Nair [2]. Balanced incomplete block designs and square 
lattices were included as special cases. Nair and Rao [5] broadened the definition 
so as to further include cubic and other higher dimensional lattices. Bose and 
Shimamoto [3] have rephrased the definition so as to stress the fact that the 
relations between the treatments are determined only by the parameters n; and 
pix with 7, 7, k = 1, 2, ---, m. For the special case of two associate classes 
(m = 2) the Bose and Shimamoto definition is substantially as follows. 

A PBIB block design with two associate classes is an arrangement of v treat- 
ments (or varieties) in b blocks such that: 

(i) Each of the v treatments is replicated r times in b blocks each of size k, 
and no treatment appears more than once in any block. 

(ii) There exists a relationship of association between every pair of the v 
treatments satisfying the following conditions: 

(a) Any two treatments are either first or second associates. 

(b) Each treatment has n, first and nz second associates. 

(c) Given any two treatments which are ith associates, the number pj, 
of treatments common to the jth associates of the first and the kth associates 
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of the second is independent of the pair of treatments with which we start. 
Furthermore, pj, = pi; fort, j,k = 1,2 


, a 
(iii) Any pair of treatments which are ith associates occur together in ex- 
actly \,; blocks for 7 = 1, 2. 
If either n, or nm. assumes the value zero, one associate class does not exist. 
Hence, we shall require that both nm, and nz be positive integers. 


(2.1) or = bk, 
v=nm+n+ti, 
Aim + Ame = r(k — 1), 
piu + pie +1 = pi t+ pi2 = m, 
(2.5) pu + px = pat pe tl =m, 


(26 ee 1 2 
2.6) MmpPi2 = MPu, MPo2 = NePie- 


Furthermore, it was proved that if values are assigned to the parameters of the 
first kind (v, b, r, k, Ar, Ax, Mm, and ne) satisfying (2.1), (2.2), and (2.3), then 
there is one independent parameter of the second kind (p}, for 7, j, k = 1, 2). 

The parameters of the second kind will be exhibited as elements of two sym- 
metric matrices 


1 1 2 2 
Pu Pie Pu Piz 

(2.7) Pi = ( : 7 P; = ( eee ) 
Pa P22 Pu P22 


Nair [6] established a necessary condition for PBIB designs having k > r. 
This condition was used by Bose [1] in exhausting the subclass of PBIB designs 
with two associate classes and \,; = 1 and \, = 0, with r = 2. For the special 
case of PBIB designs with two associate classes with \; = 1 and \. = 0, and 
k > r, Nair’s condition simplifies to 


(2.8) rpiz — (r — lpi = r(r — 1), 
a useful tool in the present investigation. 


3. A less demanding definition for PBIB designs with two associate classes. 
We shall now show that for PBIB designs with two associate classes the Bose 
and Shimamoto definition is more demanding than it need be. To this end we 
establish two theorems. 

THEOREM 3.1. Let there exist a relationship of association between every pair among 
the v trealments satisfying the conditions: 

(a) Any two treatments are either first or second associates. 

(b) Each treatment has n, first and ne second associates. 

(c) For any pair of treatments which are first associates, the number pj, 
of treatments common to the first associates of the first and the first associates of 
the second is independent of the pair of treatments with which we start. 
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Then, for every pair of first associates among the v treatments the numbers pis , 
px , and pr are constants, and Piz = Pu- 

Proor. Let @ and ¢ be an arbitrarily chosen pair of first associates from among 
the v treatments. Let pj, (8, ¢) denote the number of treatments common to the 
jth associates of @ and to the kth associates of ¢, for 7, k = 1, 2. The m, first 
associates of @ are made up of ¢, the pi:(@, ¢) treatments which are first asso- 


ciates of @ as well as @, and the pj.(@, ¢) treatments which are first associates 
of @ but second associates of ¢. Hence 


(3.1) l a pil, ¢) aa pir, o) = hh. 
Likewise, classifying the first associates of ¢, we have 
(3.2) 1+ pul, o) + pu(0,¢) = n. 


Similarly, the nz second associates of @ are made up of the p2(6, ¢) treatments 
which are second associates of @ and first associates of ¢, and the p2.(@, $) treat- 
ments which are second associates of both @ and @. Hence 


(3.3) pul, ) + pro(0,o) = nm. 


But by hypothesis, pil, ¢) is independent of the pair of treatments 6 and @ 
and is pj: . Hence, from (3.1), (3.2), and (3.3) we obtain 


(3.4) pi2(8, @) = pul(0,d) = m — pu — 1, 


(3.5) 22(8,¢) = m2 — m+ put. 


Since @ and ¢ are an arbitrarily chosen pair of first associates, the relations 
(3.4) and (3.5) amount to a proof of the theorem. 

TuEoREM 3.2. Let there exist a relationship of association between every pair 
among the v treatments satisfying the conditions: 

(a) Any two treatments are either first or second associates. 

(b) Each treatment has n, first and nz second associates. 

(c) For any pair of treatments which are second associates, the number pi 
of treatments common to the first associates of the first and the first associates of the 
second is independent of the pair of treatments with which we start. 

Then, for every pair of second associates among the v treatments the numbers 
Dis, Par, and pr are constants, and pi2 = pr . 

Proof is similar to that of Theorem 3.1. In fact, it can be shown that 


(3.6) Di2(9, >) pul, ¢) = m — Pi, 
(3.7) p2(8,¢) = nm — m + pun — 1, 


where @ and ¢ are an arbitrarily chosen pair of second associates. 
The question naturally arises whether one of the preceding theorems implies 
the other. The answer is no. Consider the following design w.th v = 7 treat- 
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ments in b = 14 blocks, each of size k = 2, and with each treatment replicated 
r = 4 times: 


(0, 1), (1, 2), (2, 3), (3, 4), (4, 5), (5, 6), (6, 0), 
(0, 3), (1, 4), (2, 5), (3, 6), (4, 0), (5, 1), (6, 2). 


Each treatment of this design has n, = 4 first associates and nm, = 2 second 
associates with 4; = 1 and \, = 0. For any pair of treatments @ and ¢ which 
are second associates, pii(@, ¢) = 3. Hence, this design satisfies Theorem 3.2 
with piz = po = 1 and px = 0. However, for any two treatments a and 8 
which are first associates, pi:(a, 8) = 1 or 2, and Theorem 3.1 is not satisfied. 

Insofar as PBIB designs with two associate classes are concerned, the conse- 
quence of Theorems 3.1 and 3.2 is that the definition of a PBIB design given 
by Bose and Shimamoto demands more than is needed. For PBIB designs with 
two associate classes, a less demanding definition could be formed by replacing 
condition (ce) of (ii) in Section 2 by 

(c’) For any pair of the v treatments which are ith associates, the number, 
pi: for i = 1, 2, of treatments common to the first associates of the first and the 
first associates of the second is independent of the pair of treatments with which 
we start. 

This new definition and Theorems 3.1 and 3.2 are then equivalent to the 
Bose and Shimamoto definition. Under the Bose and Shimamoto definition, 
to prove that an arrangement of objects is a PBIB design, it is necessary, in- 
sofar as (c) of (ii) is concerned, to show the constancy of all eight parameters 


pix for i, j, k = 1, 2, and also to show that the equalities pj, = pi; hold for 
j # k andi, k = 1, 2. By use of Theorems 3.1 and 3.2, with regard to the par- 
ameters of the second kind, it is necessary only to show the constancy of pi 
and pj, . 


4. Complete enumeration of PBIB designs with two associate classes and 
k>r 2 2,withd, = 1 and), = 0. We first establish the following useful lemma. 

Lemma 4.1. For any PBIB design with two associate classes and k > r 2 2, 
with A; = 1 and dX» = 0, 


(4.1) k—2 Ss pu S (k — 2) + (r — 1). 


Proor. The left portion of (4.1) is obvious; consider the right portion. Let the 
treatments @ and ¢ be first associates. Then they occur together in exactly one 
block which we shall denote by B(@, ¢). There are k — 2 other treatments in this 
block which are first associates of 6 as well as ¢. Since \; = 1, both @ and ¢ can- 
not occur together in any other block. Denote the r — 1 blocks in which @ 
but not @ occurs by B;(@), for 7 = 1, 2,---,r — 1, and similarly the r — 1 
blocks in which ¢ but not @ occurs by B,(@), for 7 = 1,2, ---,r — 1. If a treat- 
ment does not occur in B(@, @) but is a first associate of @ as well as ¢, it must 
occur exactly once in the blocks B,(@), and exactly once in the blocks B;(@). 
But the block B ,;(¢) cannot have more than one treatment in common with any 
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of the blocks B,(@), B2(@), --- , B,1(@). Hence B;(¢) cannot contain more than 
r — 1 first associates of 6. This holds for 7 = 1, 2, --- , r — 1. Therefore there 
cannot exist more than (r — 1)* treatments which occur once among the blocks 
B,() and once among the blocks B,(¢). Thus pi, cannot exceed (k — 2) + 
(r — 1)’, which proves the lemma. 

We shall now obtain the combinatorial parameters of all designs belonging to 
the class under consideration. From (2.4) and (2.6) we obtain 


(4.2) MPic + Mepis = MM. 

Solving (2.8) and (4.2) simultaneously, we have 

(4.3) Pi2 = na(r — 1)(r + m) / [m(r — 1) + ne] 
(4.4) Piz = rm(ne — r + 1) / [mr — 1) + ng). 
From (2.2) and (2.3) we get 

(4.5) m = r(k — 1) 

(4.6) v=m+1+r(k — 1). 

From (2.1) and (4.6) it follows that 

(4.7) b =r + (r/k)(m — r+ 1). 

Since both b and r* must be integral, we set 


(4.8) rine —r+1) = sk 


where s is an integer. Then, from (4.8), (4.7), (2.1), (4.3), (4.4), and (4.5) we get 
(4.9) nm, = sk/r +r — 1, 

(4.10) =r+s, 

(4.11) » = k(r’ + 8)/r, 


(4.12) = (r — 1)k — r(r — 1)°(k — 1) /[s + r(r — 1)], 
(4.13) 2 = r(k — 1) — r(r — 1k — 1) / [8 + r(r — 1)). 


From (4.12) and (4.13) it is seen that r(r — 1)°(k — 1) and r'(r — 1)(k — 1) 
must both be integral multiples of s + r(r — 1). Hence, their difference must 
also be divisible by s + r(r — 1). Therefore, we introduce an auxiliary integral 
parameter ¢ defined by 


(4.14) t= r(r — 1)(k — 1) /[s + r(r — 1)). 
Then 

(4.15) s=r(r — 1)(k —t — 1)/2, 
From (4.9) through (4.13) and (4.15) it follows that 
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(4.16) v=k(r —- D(kK- 1) 4+ 4/1, 

(4.17) b=r{(r—-1(k-1)+ 4/2 

(4.18) m = (r — 1)(k — 1)(k — t/t, 

(4.19) Pi = (r — 1)(k — 0), 

(4.20) Piz = r(k — t — 1). 

From (4.19), (4.20), (2.4), and (2.5) we get 

(4.21) pu = t(r —1) +k —r—1, 

(4.22) pn = (r — 1)(k—t) (k-—t — 1)/t, 

(4.23) pu = rt, 

(4.24) poo = [(r — 1)(k — 1)(k — 2t) + t(rt — k)I/t. 
Applying Lemma 4.1 to (4.21), we obtain 

(4.25) l<t<r, 


a most useful set of bounds on the integral parameter ¢. It is now seen that the 
divisor, s + r(r — 1), in (4.12) produces no difficulty because 


(4.26) (r— 1)(K-—1) Ss4+r(r — 1) S r(r — Ik - 1), k>re2 2. 


In summary, all PBIB designs with two associate classes belonging to the class 
characterized by k > r 2 2, with A; = 1 and 2 = 0, are obtainable from 


»= k[(r — I)(k — 1) + df/t, r=f, u = 1, 
ri(r — I(k — 1) + Aft, k = k, Ae = 0, 
r(k — 1), m = (r — 1I)(k — I(k — d/t, 
@-D)r-)+k-2 (r — 1I)(k — t) 
. ( (r—-Dik-2b (r—D)ik-O)k—-t— oon 


‘ ( rt rik —t —1) 
a rk—-t—1) [(r — Dk — Ik — 2d) + t(rt — k)I/t ’ 


where 1 St Sr. 
Connor and Clatworthy [4] have shown (Theorem 5.1) that for a PBIB design 
with two associate classes it is necessary (but not sufficient) that the quantities 


(4.28) a =[(v — 1I(— ¥ + Va + 1) — 2m) /2VA 
(4.29) a. = [(v — 1I)(y + VA +1) — 2nJ]/2 VA 


be positive integers, where 


(4.30) Y=Pe-—Px, B=Peotpe, A=7y+24+1. 
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Thus for any design of (4.27) it is necessary that 
(4.31) a = rk(r — 1)(kK —1)/tk +r—t-— 1). 


In the case of designs of three replications, there are only three series, one 
corresponding to each of the three permissible values of ¢. Setting ¢ = 1, 2, and 
3 in (4.27) we obtain 


v = k(2k — 1), r= 
3(2k — 1), 


we 3(k — 1), 


3, 

| ° 

= k, nm = 2k — 1), 
(4.32) 

| 


b k 
( -*? 2(k — 1) ( 3 3(k 
P, as . P, on ; 
\ 2(k — 1) 2(k — 1)(k — 2) 3(k — 2) 2 


m = 3(k — 1), 


6 3(k — 3) 
2(k — 2) (k — 2)(k — 3) as a — 3) k —6k+ st 
v= k(2k+1)/3, r=3, = = 3(k — 1), 
b= 2k +1, k=k 2 = 0, 2 = 2(k — 1)(k — 3)/3, 


k+2 2(k — 3) 
(4.34), P, = : 
2(k — 3) 2(k — 3)(k — 4)/3 


| i ( 9 3(k — 4) 
> \3 — 4) (2k — 17k + 39)/3/) 


In (4.34), k must be of the form 3p or 3p + 1, p being an integer. 

The designs given by (4.33) are the well known lattice designs of three repli- 
cations. For all integral values of k = 3 solutions exist. 

The results of Reiss [7], together with those of Shrikhande [8], are sufficient to 
guarantee the existence of solutions for the designs of (4.34) as duals of the 
corresponding balanced incomplete block designs given by 


(4.35) v* = 2k+1, b* = k(2k + 1)/3, k* = 3, r* = k, A* = 1, 


where k is of the form 3p or 3p + 1, p being a positive integer. 

It follows from (4.31), that for designs of (4.32), 12/(k + 1) must be integral. 
The only permissible values of k are therefore 2, 3, 5, and 11. 

The designs corresponding to k = 2 and 3 are known, the design corresponding 
to k = 5 is new, and the case k = 11 is impossible. We shall defer the discussion 
of these designs to Section 7. 


5. The block structure of PBIB designs with two associate classes having 
\1 = l and \» = 0, with Pir = k — 2. First, we shall introduce the notation and 
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terminology used throughout the remainder of this paper. We shall then estab- 
lish a lemma and five corollaries which provide necessary conditions for the 
existence of the designs we consider. 

Consider any PBIB design with two associate classes, and having \; = 1 and 
d» = 0, with pi: = k — 2. Let the Greek letter @ represent any treatment of the 
design. The n, = r(k — 1) first associates of 6 will be denoted by r Latin letters 
each bearing subscripts 1, 2,--- ,k — 1. The k — 1 first associates of 6 appear- 
ing together with @ in a block will be denoted by the same letter, and first associ- 
ates of @ appearing in different blocks with @ will be denoted by different letters. 
The nz second associates of @ wili be denoted by the integers 1, 2,---, me. 
The n, first associates of 6 will sometimes be referred to as lettered treatments and 
the n, second associates of @ as numbered treatments. The r blocks containing 
treatment @ will be referred to as the 6-blocks. 

Likewise, the blocks containing the lettered treatments x;, where zt = a 
or b or ¢, ete., andj = 1, 2,---,k — 1, but not @ will be referred to as the 
x-blocks, and the r blocks containing the numbered treatment 7, with 1 S 7S 
Nz , Will be called the 2-blocks. Blocks containing only first associates of 6 or only 
second associates of 6 will be called pure blocks while blocks containing both 
first and second associates of @ will be called mixed blocks. 

In the special case of designs having \; = 1 and \, = 0, with py = k — 2, 
it will be shown that there are exactly r(r — 1)(k — 1) mixed blocks each of 
which contains one lettered treatment and k — 1 numbered treatments. Those 
(r — 1)(k — 1) mixed blocks containing first associates of @ denoted by the same 
letter (and subscripts 1, 2, --- , k — 1) will be called a group of blocks. There are 
r such groups of blocks. The group of mixed blocks containing the lettered treat- 
ment x with any subscript will be referred to as the z-group of blocks (x = a, 
b, c, ete.) Within the z-group of blocks there are r — 1 blocks containing the 
lettered treatment x;. These r — 1 blocks will be referred to as the 2;-set of 
blocks. 

Lemma 5.1. For a PBIB design with two associate classes having \, = 1 and 
ho = 0, with pi. = k — 2, any two treatments appearing in different blocks con- 
taining a common treatment must be second associates of each other. 

Proor. There is no loss in generality if the treatment common to the two 
blocks is taken as treatment @. Let x; and y;, with z # yandi,j = 1,2,---, 
k — 1, be any two treatments appearing in different blocks containing @. Since 
Ai = 1 and , = 0, two treatments appearing in the same block are first associ- 
ates of each other while any pair of treatments which do not occur together in 
any block are second associates of each other. The k — 2 other treatments 
appearing in the same block with the pair (@, x;) are first associates of both @ 
and x; . Since pi, = k — 2, all other first associates of 6 (including y;) must be 
second associates of x; . This proves the lemma. 

Coro.uary 5.1. Under the conditions of Lemma 5.1, each mized block contains 
only one lettered treatment, and the r(r — 1)(k — 1) mixed blocks can be divided into 


r distinct groups, each group containing k — 1 sets and each set containing r — 1 
blocks. 
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Proor. Let the r 6-blocks of the design be 


6 ay, 
6 by 


6 hy ly 


Since A; = 1 and A, = 0, no pair of treatments can occur together in more than 
one block. By Lemma 5.1 any pair of lettered treatments 2; and y; , with z ¥ y, 
must be second associates of each other. Hence, no two lettered treatments can 
appear together in a block free from 6. Therefore the design must contain r(r — 1) 
-(k — 1) mixed blocks, each of which contains one lettered and k — 1 numbered 
treatments. Since each lettered treatment must appear in r — 1 blocks free from 
6, there are (r — 1)(k — 1) mixed blocks containing treatments x7; , 22, --- , 
2-1 . These blocks constitute the z-group of blocks, where z = a, b, --- , or l, 
and there are r distinct groups of blocks. Within the x-group there are r — 1 
blocks containing the lettered treatment x; , and these we have called the x;-set 
of blocks for 7 = 1, 2,---,k — 1. Each group of blocks obviously contains 
k — 1 distinct sets of blocks. 

Corouuary 5.2. Under the conditions of Lemma 5.1, the numbered treatments 
appearing in a group of blocks must all be distinct. Hence nz 2 (r — 1)(k — 1)’. 
If n. = (r — 1)(k — 1)*, each growp must contain precisely one complete replica- 
tion of the numbered treatments. 

Proor. Since \; = 1 and A, = 0, no numbered treatment can appear in two 
blocks belonging to the same set. Suppose the same numbered treatment j/, 
for 7 = 1, 2, +--+, m2, appears in blocks belonging to different sets of the same 
group. Let the lettered treatment in one of the sets containing j be z,, and the 
lettered treatment in the other set containing j be x, , with m # n and m,n = 
1, 2,---,k — 1. Now z,, and z, are first associates of each other since they 
appear together in the same 6-block. But by Lemma 5.1, z, and z,, must be 
second associates of each other since they appear in different blocks containing 
the common treatment j, a contradiction. Therefore no numbered treatment can 
appéar in two or more blocks belonging to the same group. Hence the numbered 
treatments appearing in the same group of blocks must all be distinct. Since each 
mixed block contains k — 1 numbered treatments, (r — 1)(k — 1) distinct 
numbered treatments appear in a group of blocks. Since numbered treatments 
are second associates of 0, mn. = (r — 1)(k — 1). If m = (r — I)(k -— 1)’, 
each group must obviously contain exactly one complete replication of the 
numbered treatments. 

Coro.uary 5.3. Under the conditions of Lemma 5.1, the mixed blocks contain 
pi complete replications of the numbered treatments. The pure blocks must contain 
r — pi complete replications of the numbered treatments. 

Proor. Consider treatment @ and an arbitrarily chosen numbered treatment 
i, fori = 1, 2, --- , m. Since @ and 7 are second associates of each other, and 
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since @ has only lettered treatments as first associates, treatment i must have 
exactly pi: lettered first associates. Since lettered treatments appear only in 
6-blocks and mixed blocks, and since no numbered treatments can appear in a 
6-block, it follows by Corollary 5.1 that treatment 7 must appear in exactly 
pi mixed blocks. This is true for i = 1, 2, --- , ne ; therefore the mixed blocks 
contain exactly pj, complete replications of the numbered treatments. Now, 
there are, in all, r complete replications of the numbered treatments. Therefore 
the pure blocks consist of r — pi; complete replications of the numbered treat- 
ments. 

Coro.uary 5.4. Under the conditions of Lemma 5.1, two sets of blocks belonging 
to different groups must intersect in pi: — 1 numbered treatments; that is, they 
must have pi: — 1 numbered treatments in common. 

Proor. By Lemma 5.1 and Corollary 5.1, the pair of lettered treatments 2; 
and y;, where « ¥ y, appearing in two sets belonging to different groups are 
second associates of each other. Since x; and y; have 6 common to their first 
associates but no common lettered first associates, they must have pj; — 1 
common numbered first associates. Since z; and y; appear only in the 6-blocks 
and in the sets under consideration, these two sets must contain the pj, — 1 
common numbered first associates of x; and y;. This proves the corollary. 

SOROLLARY 5.5. Under the conditions of Lemma 5.1 a mixed block cannot in- 
tersect a set belonging to another group in more than one treatment. If ne = (r — 1) 
-(k — 1)’, then each mixed block must intersect each set of another group in exactly 
one numbered treatment. 

Proor. Suppose a mixed block intersects a set of another group in two or more 
treatments. Then consider a pair of numbered treatments common to the given 
mixed block and the intersected set of blocks belonging to another group. These 
two numbered treatments are first associates of each other since they occur to- 
gether in a block. Their common first associates will include the k — 2 other 
treatments in the block containing them both and also the lettered treatment 
occurring in all blocks of the set intersected. Hence pi: = k — 1, in contradiction 
to the hypothesis. This proves the first part of the corollary. 

Sach group must contain a complete replication of numbered treatments 
when nz. = (r — 1)(k — 1)*, and we have proved that a mixed block cannot in- 
tersect a set of another group in more than one treatment. Therefore, when 
no = (r — 1)(k — 1)’, the k — 1 numbered treatments in a mixed block must be 
distributed one to each set in the other groups. This proves the second part of the 
corollary. 

DerFInition. Two number-pairs may be defined to be set compatible if it is 


possible for them to occur in the same set without violating Lemma 5.1 and its 
corollaries. 


6. The relationship of duality between members of a certain two-parameter 
series of PBIB designs. Let D be a design with a known solution. To form a 
design D*, let the treatments of D be the blocks of D*, and the blocks of D 
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be the treatments of D*. The design D* has been called the dual of design D. 
The process described is sometimes called inverting or dualizing the design D. 
In general, the dual of a PBIB design is not itself a PBIB design. Inversion of a 
partially balanced design D can result in a dual design D* having a different 
number of associate classes than design D. Inversion of a PBIB design D may 
yield a balanced incomplete block design. 
Consider the two-parameter series of designs having 
v=K(r—1(k-D+1), r=r, s =1, r(k — 1), 
| barr -—Dk-D+, k=k, w= 2 = (r — 1)(k — 1)’, 


( k-—2 (r — 1)(k — 1) 
(6.1) < P, = c 
| r-Ik-1) (-DE-VE- ») 


| ( r r(k — 2) 
P, = ‘ ’ 
rik — 2) k(r — 1) — Gr — 2k — ») 


obtained by setting ¢ = 1 in (4.27). Note that (6.1) satisfies the conditions of 
Lemma 5.1 and its five corollaries, with equality holding in Corollary 5.2. 

Assuming the existence of a solution of some design belonging to (6.1), let us 
examine its dual. Let the parameters of the dual be distinguished by the asterisk 
as superscript. Clearly 


v* r{(r hai 1)(k cae 1) + 1}, r= k, 
(6.2) 


b* = k[{(r — 1)(k — 1) + 1], k* =r, 


The r blocks of (6.1) containing treatment @ go over into r treatments appear- 
ing in a single block of the dual design. Since no pair of 6-blocks of (6.1) contain 
another common treatment, in the dual no treatment-pair corresponding to a 
pair of blocks containing @ can occur together in a block (excepting the one 
block corresponding to treatment @). Hence 


(6.3) t=1, A =0. 


Arbitrarily choose a block of the original design and a treatment within this 
block. There are r — 1 other blocks containing this treatment. The arbitrarily 
chosen treatment may be called @ and the r blocks containing @ are then the 6- 
blocks. The arbitrarily chosen 6-block intersects each of the other r — 1 6- 
blocks in exactly one treatment. It also intersects each of the (r — 1)(k — 1) 
blocks of one group in a single treatment, but it intersects no other blocks. Hence 
in the dual 


(6.4) ni = k(r — 1). 


These nj blocks in the original design (treatments in the dual) we shall call the 
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first associates of the arbitrarily chosen block. Since the arbitrarily chosen block 
fails to intersect any of the blocks in the other r — 1 groups, in the dual 


(6.5) ns = (k — 1)(r — 1)’. 


We shall call these n} blocks the second associates of the arbitrarily chosen 
block. 

Now consider any pair of blocks which are first associates. There is no loss in 
generality in assuming that they are the ith and jth blocks containing treatment 
6 for i ¥ j; with 1 S tandj Sr. The first associates of the ith block containing 
6 consist of all @-blocks except the ith, and of all the blocks in the ith group 
(the ith group being the one whose lettered treatments occur in the ith 6-block). 
The first associates of the jth block containing @ consist of all 6-blocks except the 
jth, and of all blocks in the jth group. The blocks which are common to the first 
associates of the ith and jth @-blocks are the other r — 2 6-blocks. Hence, 


(6.6) Pit =r— 2, 


a constant for any particular design. Since the conditions of Theorem 3.1 are 
satisfied, it follows that pir, Par , and por are constants, and from (3.4) and 
(3.5) we obtain 


(6.7) piz = pu = (r — 1)(k — 1), 
(6.8) po = (r — 1)(r — 2)(k — 1). 


Next consider a pair of blocks which are second associates of each other, that 
is, a pair of blocks containing no common treatment. There is no loss in gen- 
erality if we take one of them to be the first @-block and the other to be a block 
in the jth group, where 7 ~ 1. The first associates of the first 6-block are the 
r — 1 other 6-blocks and all the blocks in the first group. The first associates of a 
block in the jth group consist of the r — 2 other blocks in the same set of the 
jth group, one block (not the first) among the 6-blocks, and k — 1 blocks in 
each of the r groups excepting the jth. Hence 


(6.9) pu =k, 


a constant for a particular design. Thus it is seen that the conditions of Theorem 
3.2 are satisfied, and hence, piz , p21, and p22 are constants. Furthermore, from 
(3.6) and (3.7) we obtain 


(6.10) Dis = pu = k(r — 2), 


(6.11) pe = r(k — 1) — (8k — 2)(r — 1). 


In the original design, any pair of blocks which are first associates interesect in 
a single treatment, so At = 1, while any pair of blocks which are second associ- 
ates fail to intersect at all, so A = 0. 
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We have shown that the conditions (i), (ii), and (iii) stated in the definition 
of a PBIB design with two associate classes are satisfied. Hence, the dual of 
design (6.1) is a PBIB design with two associate classes having parameters 

v* =r[(r—D(k—1) +1), r* =k, AT =1, nt = k(r —1), 


b* = Kir — 1)(k —1) +1], k* =r, AE =0, nb = k—DG—-D* 


r—2 (r — 1)(k — 1) 


| 
(6.12){ PT = ( 


(r—1)(k-—1) (r — 1)(r — 2)(k — 1) 


we ( k k(r — 2) ) 
" \b@ — 2) #0 = 2) ~- (0k — 2) — 1)/ 


If in (6.12) we replace k with r* and r with k*, it is seen that the dual has the 
same form as the original design (6.1) and hence belongs to the same series. 
We have proved that the existence of the original design (6.1) implies the ex- 
istence of (6.12) as the dual of (6.1). If we now assume the existence of (6.12), 
it can be shown by exactly the same type of argument that its dual exists and is 
given by (6.1). We have thus proved the following theorem. 

THEOREM 6.1. Between corresponding designs of (6.1) and (6.12) there exists a 
relationship of duality. They exist or fail to exist simultaneously. 

The theorem of duality implies that if a design belonging to one of the series 
(6.1) or (6.12) is impossible, then the corresponding design of the other series is 
also impossible. Suppose that there exists a solution of a design of one of the 
series, but that the corresponding design of the other series is impossible. Then 
by Theorem 6.1, upon inverting the design whose solution is known, we obtain 
the solution of the corresponding design of the other series. But this contradicts 
our assumption. Consequently, the first statement of this paragraph is true. 

Actually, the series (6.1) and (6.12) are the same, since we may obtain (6.1) 
from (6.12) by interchanging r and k. However, the correspondence between 
designs of (6.1) and (6.12) gives a means of pairing off those designs which are 
duals of each other. 


7. Constructions and impossibility proofs. Putting k = 3 in the two-parameter 
series of designs (6.1), we obtain the single-parameter series, with r = 2 


2, 
“ae 
ore 

2(r 

2(r 


Putting k = 3 in the two-parameter series of designs (6.12), we obtain the 
single-parameter series 
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f 


v* = r(2r — 1), r* = 3, NI = 3(r — 1), 


b* = 3(2r—1), k*=r, N= 2 = 2r — 1)’, 


sas fers 2(r —1) - ( 3 Piet) 
" NS = 1) Smite - 9) \86—-2) 8-747) 


It follows from Section 6 that corresponding designs of (7.1) and (7.2) are 
duals of each other and exist or fail to exist simultaneously. Note that series 
(7.2) is identical with (4.32). It follows from (4.31) that designs of (7.2) cor- 
responding to values of r other than 2, 3, 5, and 11 do not exist, and the same 
therefore must hold for designs of (7.1). In what follows we shall show that 
designs of (7.1) with r = 6 are impossible, which rules out the case r = 11. 
We shall also give a construction for the case r = 5 for which the corresponding 
design for (7.2) can be obtained by dualization. The designs corresponding to 
r = 2 and 3 are known, and will be considered briefly at the end of this section. 

We write the r 6-blocks of (7.1) as 


Each group consists of 2(r — 1) blocks, the lettered treatments within a group 
being denoted by the same letter bearing the appropriate subscript. Each group 
contains two sets and each set contains r — 1 blocks, the blocks within a set 
containing the same lettered treatment (Corollary 5.1). Since n» = 4(r — 1), 
each group must contain exactly one complete replication of the numbered 
treatments (Corollary 5.2). Without loss of generality, we may write the first 
group containing lettered treatments a; and a; as 


a; 1 2 
ay 3 4 


a, 5 r — 1 blocks, 


r — 1 blocks. 
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There is no loss in generality in writing the })-set as 
by 2r— 1 
b é 2r+1 
by ¢ 2r+3 


by 2r — 3 4r —5 


where the };-set contains only the odd-numbered second associates of @ (Corol- 
laries 5.4 and 5.5). Then, the b.-set must contain all the even-numbered second 
associates of 6 (Corollary 5.2). The treatments 2, 4, 6, --- , 2r — 2 must occur 
one in each block of the be-set, the same being true for the r — 1 treatments 
2r,2r + 2,2r+4,---,4r —4. 

We shall now show that the pairing off of the even-numbered treatments in 
the b.-set is uniquely determined. Since no pair of numbered treatments ap- 
pearing in the a;- or a2-set can occur together in any block of the b-, c-, --- , and 
l-groups (Ai. = 1, A: = 0, and Lemma5.1), we form a lattice (Diagram 1) having 
horizontal coordinates 1, 2, 3, --- , 2r — 2 and vertical coordinates 27 — 1, 
2r,2r + 1, --- ,4r — 4. The coordinates of the cells of the lattice diagram give 
all conceivably possible number-pairs which might occur in the b-, c-, --- , and 


2r — 52r—42r—32r-—2 





























DraGramM 1 
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l-groups. The cells whose coordinates appear as a pair in some block of the 
design are marked with a P and the cells whose coordinates are ruled out by 
Lemma 5.1 or one of its corollaries are marked with a X. The cells corresponding 
to the number-pairs appearing in the b,-set are those lying in the upper left 
corner of the subsquares of side 2 lying along the main diagonal of the lattice 
diagram. 

The number-pairs whose coordinates are the numbers occurring with 7 in 
the a,- and ),-sets, where i = 1, 3, 5, --- , 2r — 3, are ruled out by Lemma 5.1. 
So are the number-pairs whose coordinates are the numbers occurring with j 
in the a2- and },-sets, where j = 2r — 1, 2r + 1, 2r + 3, --- , 4r — 5. The cells 
corresponding to these number-pairs lie in the upper right and lower left corners 
of the subsquares of side 2 lying along the main diagonal of the lattice diagram. 
Furthermore, Lemma 5.1 rules out the occurrence of the number-pairs indicated 
in Diagram 2. 

3 5 


2r— 5 2r —3 


(3, 2r — 1) 


(3, 2r + 3) 


(3, 4r — 7) 


(5, 2r — 1) 
(5, 2r + 1) 


(5, 4r — 7) 


(2r — 5, 2r — 1) 


(2r — 
(2r — 


5, 2r + 1) 
5, 2r + 3) 


(2r — 3, 2r — 1) 


(2r — 3, 2r + 1) 
(2r — 3, 2r + 3) 


(2r — 3, 4r — 7) 


(3, 4r — 5) oe 


(5,4r — 5) +--+ (2r 
DIAGRAM 2 


5, 4r — 5) 


The corresponding cells in the lattice (Diagram 1) are those lying in the odd 
numbered columns and odd numbered rows, with the exception of those lying 
along the main diagonal of the lattice diagram. In other words, they are all cells 
lying in the upper left corners of the subsquares of side 2, with the exception of 


those lying along the main diagonal of the lattice diagram. 

Treatments 2r — 1, 2r + 1, 2r + 3, --- , 4r — 5 must each occur in each of the 
remaining r — 2 groups (Corollary 5.2). The available number-pairs involving 
these treatments are given in Diagram 3. Since there are only r — 2 number- 


2 


4 


(2, 2r + 1) 
(2, 2r + 3) 


(2, 4r — 7) 
| (2, 4r — 5) 


pairs involving each of the treatments 2r — 1, 2r + 1, 2r + 3, --- 


(4, 2r — 1) 


(4, 2r + 3) 


(4, 4r — 7) 
(4, 4r — 5) 


6 


(6, 2r — 1) 
(6, 2r + 1) 


(6, 4r — 7) 
(6, 4r — 5) 


DIaGRaM 3 


2r — 4 


(2r 


4, 2r — 1) 
4, 2r + 1) 
4, 2r + 3) 


4, 4r — 5) 


2r — 2 


2, 2r — 1) 
2, 2r + 1) 
2, 2r + 3) 


2, 4r — 7) 


, and 4r — 5, 


each of the number-pairs in the Diagram 3 must occur in the design. 
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Comparing the available number-pairs of Diagram 3 with those in the a»-set, 
and using Lemma 5.1, rules out the number-pairs indicated in Diagram 4. Thus 


2 + 6 eee 2r — 4 2r — 2 





-<-- (4, 2r) (6, 2r) tee (2r — 4, 2r) (2r — 2, 2r) 
(2, 2r + 2) --- (6,2r+2) --- (27 —4,2r +2) (2r — 2, 2r + 2) 
(2,2r +4) (4, 2r + 4) --- see (2r — 4,2r + 4) (2r — 2, 2r + 4) 


(2,4r — 6) (4,4r—6) (6,4r—6) --- ese 
(2,4r — 4) (4,4r—4) (6,4r —4) --- (27 — 4, 4r — 4) 
D1aGRAM 4 


in the lattice (Diagram 1) we cross out the cells lying in the lower right corners 
of all subsquares of side 2, except for those lying along the main diagonal of the 
lattice diagram. 

Examination of the lattice diagram now shows that the only cells for which 
both coordinates are even numbers are those lying along the main diagonal, 
that is, cells whose coordinates are 


(2, 2r), (4,27 +2), ---, (@r— 2,4 — 4). 


Thus the pairing off of the numbered treatments appearing in the b.-set is 
uniquely determined. The r — 1 blocks of the be-set are: 


be 2 


be 4 
6 


be 


be 2r — 2 4r — 4, 


The occurrence of the above number-pairs in the bo-set rules out only number- 
pairs which have been previously excluded. 

Each of the treatments 2r, 2r + 2, 2r + 4, --- ,4r — 4 must occur in each of 
the remaining r — 2 groups (Corollary 5.2). The only available number-pairs 
involving these treatments are given in Diagram 5. Again, there are only r — 2 
pairs involving the treatments in question. Hence, each of the number-pairs in 


1 3 5 ane ar — 5 2r — 3 

2r --- (3, 2r) (5,2r) ++» (2r — 5, 2r) (2r — 3, 2r) 
2r +2) (1, 2r + 2) -~<- (5,2r +2) --- (27 —5,2r+2) (2r — 3, 2r + 2) 
ar+4 (1,2r+4) (3, 2r + 4) --- vee (27 — 5, 2r + 4) (2r — 3, 2r + 4) 


4r —6) (1, 4r — 6) (5,4r — 6) --- --- (2r — 3, 4r — 6) 
4r— 4. (1, 4r — 4) (5,4r — 4) +--+ (2r — 5, 4r — 4) --- 
DraGram 5 
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Diagram 5 must occur in the design. Furthermore, these number-pairs and those 
in Diagram 3 involving treatments 2r — 1, 2r + 1, 2r + 3, --- , 4r — 5 are the 
only available number-pairs. 

In each of the number-pair diagrams of available pairs, it is obvious that 
number-pairs lying in the same row or in the same column are not set com- 
patible. Furthermore, the occurrence in any set of two unsymmetrically located 
number-pairs with respect to the main diagonal of the number-pair diagram 
would rule out the occurrence of the number-pair lying at the interesection of the 
rows and columns containing the two number-pairs in question (Corollary 
5.5). Hence, only symmetrically located pairs with respect to the main diagonal 
of the number-pair diagram can possibly be set compatible. 

Now each set contains r — 1 blocks and consequently requires r — 1 number- 
pairs. But we have available a maximum of 4 possibly set-compatible pairs, two 
from each number-pair diagram. Hence, all designs of the series (7.1) having 
r = 6 are combinatorially impossible. 

We shall now give a construction for the design of (7.1) corresponding tor = 5. 
Its parameters are 


b = 45, k = 3, Ae = 0, m= 


ee oo we 


h = 27, r= 5, uM = 1, ny 10, 


(7.3) 


By the preceding argument it is seen that we may write, without loss of 
generality, the five 6-blocks, the eight mixed blocks of the a-group, the eight 
blocks of the b-group, and the distribution of the lettered treatments throughout 
the remaining mixed blocks as shown below. The placing of the underlined 
numbered treatments comes later. 


d-Group 





9 
ll | ll 
13 | 14 
15 | 16 


KO 1 iO IS 
| bh ba fe | 
I> RO KOKO | 
| 





10 2 13 
12 | es 15 
14 |e 10 
16 | 12 


DraGcram 6.—Plan for design (7.3) 


th iG ie 100 
— — 
be 1S Or 
— m1 Me Im 


| 
| 
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From Diagram 3 of available number-pairs it is seen that the only available 
number-pairs involving treatments 2, 4, 6, and 8 are those shown in Diagram 7. 


(2, 11) 
(2, 13) 
(2, 15) 


4 6 


(4, 9) 


(4, 13) 
(4, 15) 
D1aGram 7 


(6, 9) 


(6, 11) 


(6, 15) 


8 


(8, 9) 
(8, 11) 
(8, 13) 


From Diagram 5 of available number-pairs we also see that the only available 
number-pairs involving treatments 1, 3, 5, and 7 are those shown in Diagram 8. 


1 


(1, 12) 
(1, 14) 
(1, 16) 


3 5 


(3, 10) 
(3, 14) 
(3, 16) 


(5, 10) 
(5, 12) 


=~ 
‘ 


(7, 10) 
(7, 12) 
(7, 14) 


(5, 16) 
Dracram 8 


Since these are the only available number-pairs and since each of the numbered 
treatments must occur in each of the c-, d-, and e-groups, by Corollary 5.2 the 
design must contain all the number-pairs in these two arrays (Diagrams 7 
and 8). 

Now, by Corollary 5.2 the number-pairs (4, 9), (6, 9), and (8, 9) must be 
distributed one in each of the last three groups. Suppose we arbitrarily form 
the three blocks 


Ci 4 9g 
dy 6 9 
€i 8 9. 


Since each set must contain four number-pairs and since only symmetrically 
located pairs with respect to the main diagonal of the number-pair diagram are 
set compatible, we must also form the blocks: 


Ci 2 11 
dy 2 13 
e\ 2 15. 


The occurrence of the pairs (4, 9) and (2, 11) in the c;-set precludes, by Corol- 
lary 5.5, the occurrence in this set of pairs involving 1, 3, 10, and 12. Thus, the 
¢;-set must be completed with (5, 16) and (7, 14). Then by Corollary 5.2, we 
must complete the c:-set by use of pairs (8, 13), (6, 15), (3, 10) and (1, 12). 
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The occurrence of pairs (6, 9) and (2, 13) in the d,-set precludes, by Corollary 
5.5, the occurrence in the same set of any pairs involving 1, 4, 5, 8, 10, 11, 14, 
and 15. Thus, it is seen that the only pairs which can occur with (6, 9) and (2, 13) 
in the d,-set are (7, 12) and (3, 16). Then by Corollary 5.2 the d:-set must con- 
tain pairs (8, 11), (4, 15), (5, 10), and (1, 14). 

Since the e-set contains pairs (8, 9) and (2, 15), Corollary 5.5 rules out the 
occurrence in the e;-set of any pair involving 1, 7, 10, 16, 4, 6, 11, or 13. Thus, it is 
seen that only (5, 12) and (3, 14) are available. Then by Corollary 5.2 the pairs 
(6, 11), (4, 13), (7, 10), and (1, 16) must occur in the é:-set. This completes the 
construction of design (7.3). The conditions set forth in Lemma 5.1 and in 
Corollaries 5.1 through 5.5 are satisfied, and Theorems 3.1 and 3.2 are also 
satisfied. 

When r = 2, the design (7.1) is a lattice design obtained by arranging 9 
treatments in a 3 X 3 square and taking the rows and columns for blocks. The 
corresponding design of (7.2) is the dual of this and is a group divisible design 
[3] with 6 treatments divided into two groups, say 1, 3, 5 and 2, 4, 6. The 9 
blocks are obtained by taking all possible pairs consisting of one treatment from 
each group. 

When r = 3, both series (7.1) and (7.2) lead to the same symmetrical (self- 
dual) design with 15 treatments and 15 blocks. This is a triangular design with 
known solution [3]. Hence we may state the following theorem: 


THEOREM 7.1. The series of designs (7.1) contains only three combinatorially 
possible designs: 


(1) the lattice design withy = 9, r=2, andk = 3; 
(2) the symmetrical triangular design withy = b = 15, r=k=3, m = 6, 


nm =8, y= 1, A =); 

(3) the design with parameters (7.3) and plan following (7.3). 

Combining the results of this section with those of Sections 4, 5, and 6, we may 
state the following theorem: 

TuroreM 7.2. Partially balanced incomplete block designs with two associate 
classes for which k > r = 3, with \y = 1 and d, = 0, must belong to one of the 
following classes: 

(a) Designs obtained by dualizing balanced ineomplete block designs with k = 3 
and \ = 1. These designs belong to series (4.34). 

(b) Lattice designs with three replications belonging to series (4.33). 

(ec) The design of series (4.32) for which k = 5. This is the dual of the design 
with parameters (7.3). 
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0. Summary. Statistical problems involving angular observations may arise 
in diverse scientific fields, either from direct measurement of angles—say the 
direction of winds or of glacial pebbles or of fracture planes—or they may arise 
from the measurement of times reduced modulo some period and converted into 
angles—say time of day when train wrecks occur. Specifically, we consider a set 
of n points £, situated on a unit circle and assumed to constitute a sample from 
a distribution having the p.d.f. g(é), where 0 S & < 27. Let the n random unit 
vectors thus defined have the components sin £, and cos &, , and set 


(0.1) V=e=>Sicost, We dosing, R= SY? + W. 


Let P(r, n) be the probability that R < r, and let Q = 1 — P(r, n). 

This paper shows how the statistics V and R provide tests for the uniform 
distribution g(¢) = 1/2. The distribution of R on the hypothesis of uniformity 
was derived by Kluyver as a solution to Pearson’s random walk problem, and 
is tabulated here for use in significance tests. The distribution of V is derived 
here, but has not been calculated. 

To illustrate the type of tests that might employ the statistics R and V, 
consider a carnival wheel—first from the standpoint of the punter who suspects 
bias, second from the standpoint of the mechanic who has attempted to in- 
troduce bias. The punter, by studying the performance of the wheel, might wish 
to answer two questions: first, does the wheel differ credibly from an unbiased 
wheel? second, what is the direction and extent of the bias, if any? An answer 
to the first question is obtainable from the distribution of R, and an answer to 
the second has been provided by Mises. The mechanic, on the other hand—be- 
cause he knows the direction of the bias, if there is a bias—might better use the 
statistic V as a test of his success, and he might appropriately modify the Mises 
approach in estimating the extent of the bias. 











1. Pearson’s random walk. In 1905 Pearson [13] posed the following problem: 

“A man starts from a point O and walks a distance a in a straight line; he 
then turns through any angle whatever and walks a distance a in a second 
straight line. He repeats this process n times. 


Received April 6, 1954. 


1 This is a revised version of a paper presented at the Kingston meeting of the Institute 
of Mathematical Statistics in September 1953, under the title ‘““The Integral Solution of 
Pearson’s Random Walk Problem and Related Matters.’’ 
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“T require the probability that after these n stretches he is at a distance be- 
tween r and r + 6r from his starting point, 0.” 

Thus, Pearson was assuming a sample of n random angles £, from the uniform 
distribution g(€) = 1/27, and his required probability is the differential. 


(1.1) £ PCr, n) ér. 


After Pearson’s statement of the random walk problem, Rayleigh [16] ob- 
served an analogy to the theory of vibrations and deduced the asymptotic 
formula 


(1.2) Q(r, n) = exp[—r’/n]. 


Then Kluyver [7] obtained a solution, showing that the probability of R S r is 
(13) P(r,n) =r [ (Jo(a)PJu(rz) dz. 
J0 


This formal result can easily be found by bivariate characteristic functions, but 
the transformation from rectangular to polar coordinates is difficult to make 
rigorous. 

Both Kluyver and Pearson were loath to attempt direct quadrature of (1.3) 
or its p.d.f. (see [14], p. 5), and Pearson devoted his efforts toward developing an 
asymptotic approximation (Section 6). We, however, felt that Pearson’s series 
could best be checked by quadratures and that, with the advent of extended 
tables of the Bessel functions and improved calculating equipment, this was now 
feasible. Table 1 presents quadrature values of Kluyver’s integral. Table 2 
presents interpolated 5 percent and 1 percent points for r, and for several func- 
tions of r that may be useful in making significance tests. A discussion of the 
calculation procedure appears in Sections 4 and 5. 


2. The problem of Mises. In another problem involving the summation of 
random unit vectors, Mises [12] considered vectors distributed according to 
the p.d.f. 


exp{k cos(é — a)} 

£ SE 

(2.1) g(é) 2Qr1y(k) ’ 

of which the uniform distribution is a degenerate case (k = 0). He then showed 


that a joint maximum likelihood estimate of the concentration parameter k and 
the modal angle a@ is obtainable from the vector sum by means of the relations 


(2.2) R cos a = >> cost, = V, R sin a = >> sing, = W. 


Recently Gumbel, Greenwood, and Durand [6] christened (2.1) the “circular 
normal distribution,” tabulated the integral thereof, and calculated a table for 
converting R/n = a into k, the maximum likelihood estimate’ of k. 


2 According to Mises ((12], eq. 16), R/n = 1,(k)/Io(k), which is tabulated by Gumbel et 
al. ([6], p. 140). 
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TABLE 2 
5 and 1 percent points for r and derived functions 


5 percent — P(r, m) = .95 1 percent — P(r, s) = .99 


rt z= 1/n r r/n r2 


i. 2.8573 -951 8251 24. 
20. 2.8819 .394 .7705 29. 
23. 2.9014 797 7246 33. 
26. 2.9102 | -185 .6872 38. 
29. 2.9187 | 559 .6550 42. 


32. 2.9262 | 893 .6266 47. 
35. 2.9320 .220 .6017 52. 
38. 2.9370 |, 7.533 5795 56. 
41. 2.9413 .833 5595 61. 
44. 2.9450 ‘ 5415 


47. 2.9482 40: .5251 
a0. 2.9511 d .5101 
53. 2.9536 9 .4963 
56. 2.9558 9. .4836 


59. 2.9579 | 9. .4718 





62. 2.9597 ‘ -4608 
65. 2.9613 ; -4505 98 
68.1! 2.9629 : .4409 102. 
2.9642 
2.9957 .6052 





Accordingly, if the punter wishes a joint maximum likelihood estimate of the 
direction and extent of bias in the carnival wheel, he may easily obtain this if he 
is willing to assume that the wheel is biased according to the Mises distribution 
law. Moreover, if he wishes to test whether the observed performance of the 
wheel is consistent with the uniform hypothesis k = 0, he may do so by a simple 
extension of Mises’ argument. In fact, the likelihood ratio for testing k = 0 
against the alternative k = k and a = & is 


(2m) = tne 
(24 Io(k)\-" exp[kE cos (& — &)] exp [KR] 


From the fact that the second derivative of In J)(k) is the variance of the linear 
distribution of cos £, and therefore essentially positive, it can be shown that the 
above ratio, for fixed n, is a decreasing function of R alone, where R has the dis- 
tribution (1.3) on k = 0. The test of k = 0 then consists in comparing R or R° 
with the tabled values of r or r* (Table 2). An example will be given in Section 8. 

The above hypothesis k = k and a = a—the punter’s alternative—is but one 
of several against which uniformity might be tested. In fact, twelve possible 
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hypotheses can easily be formed by coupling one of the three a-hypotheses with 
one of the four k-hypotheses: 


0Sa<2r, a Sasa, a = a; 


k = k* > 0, (k* unknown), k=k, k2kh, kzkek. 


In particular, a = a and k = k* > 0 is the appropriate alternative for the 
mechanic who has attempted to bias the carnival wheel; the mechanic, unlike the 
punter, knows the position of the mode if there is a mode. And if he rejects uni- 
formity, the mechanic needs to estimate only the parameter k. 

The mechanic’s problem is of special interest because of its relation to the work 
of Mises, who formulated his “Kriterium der Ganzzahligkeit”’ ({12], p. 495 ff.) 
in hopes of showing that atomic weights are integers subject to error. The ob- 
served atomic weights reduced modulo 1 and transformed into angles are subject 
to a test of the uniformity hypothesis against the alternative a = 0 and k = 
k* > 0. In making this test, Mises proceeded to estimate k from the statistic 
R; but, on the alternative stated, V and not R furnishes a maximum likelihood 
estimate of k. 

The likelihood In L = —n In Io(k) + kV — n In 2. Setting 

d I,(k) 

aL = ~aae* ” % 
we find that V/n = I,(k)/Io(k), so that the recent table ([6], p. 140) applies. 
The distribution of V is derived from the characteristic function of cos £, 


Qr 
¢(z) = = | eft de = Jo(zx). 


Therefore, the sum of n independent cosines has the characteristic function 
[Jo(x)]” and the distribution function 


c+o[ tert — de = c+) | ote 2 ar. 
2a sw ix i | : 


When v = 0, the integral vanishes; therefore C = 4 and the probability that 
V < v takes the form 


(2:3) S++] (ole) 2% ae 

- “0 zx 
on the hypothesis k = 0. In the limit, V is normally distributed with mean zero 
and variance n/2. 


3. Nonzero k and the power function. Under some circumstances the me- 
chanic may feel sanguine that he has effectively biased the wheel and yet have 
doubts whether the direction of bias is exactly zero, as designed. Then, an 
entirely different sort of test would be needed—in fact one analogous to the 
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linear t-test for the difference between an observed mean and its hypothetical 
value. But the construction of this test—and several others that might be use- 
ful—requires investigation of the distribution of R or of & — a, either jointly 
or severally, on the hypothesis k = k, > 0. 

On this hypothesis the joint distribution of V and W (or of R and @), the dis- 
tribution of R, and the distribution of V can be obtained by an artifice due to 
Fisher’ [4], which derives these distributions painlessly from those on the hy- 
pothesis k = 0. Fisher seems to effect his derivation by means of the following 

Lemma. Let F(x, y) and G(x, y) be two distribution functions such that 


(3.1) dF(z,y) = Ae**” dG(z,y), A= I et 4G(x, y). 


Denote the distributions of the n-fold convolutions of F(x, y) and G(x, y) respec- 
tively by F,(v, w) and G,(v, w). Then 
(3.2) dF ,(v, w) = A"e”*™ dG,(v, w). 

To prove this let 


$(t, u) = I eet) aF(x, y), 


vit, u) an I gitet) dG(a, y), 
where the integrals are Stieltjes integrals extended over the x, y-plane. Then 


$"(t, u) oa I eitetew) dF, (v, w), 


v°tt,u) = ff et aay, w). 


By hypothesis (3.1) 
g(t, u) = I et Act” dG(zx, y) 


= Ay|(t — ia), (u — ib). 
The characteristic functions of the left and right sides of (3.2) are, respectively, 
o°(t, u) = A*y"[(t — ia), (u — 1b); 
[ff erm arer*™ aG,(o, w) = A*Y"I(t — ia), (w — i). 
The equality (3.2) follows from the equality of the characteristic functions. 
8 Fisher’s work is concerned with the spherical analogue of the Mises distribution (2.1). 


Although the trigonometry is more involved in three dimensions than in two, the algebra 
is much simpler. See Rayleigh [17], p. 338 ff. 
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On the hypothesis k = 0, the differential of (2.3) is 


dP = ot. [Jo(x)]” cos vx dz. 
TT 0 


Therefore, by (3.2) the differential of probability on the hypothesis k = k, and 
= Ois 


aP = sari 1 Yael a, 
so that 


e* 1—iz)v 


PV So) = sare | ela" ae 


This distribution is the power function of V sinalait the Mises alternative a = 
Oandk = k, > 0. 

To obtain the distributions of W and R, one first writes down the differential 
of the joint probability of V and W on the hypothesis k = 0. The probability 
density function corresponding to (1.3) is circularly symmetric and has the form 


4) SL Se" taete)lsilrn) de = 2 |” sala)Idolra) 2 a, 
so that 


aPiv ++, Ws = — | Volz) ]"Jolas/P® Fw) dr. 
0 


It will be convenient to use Pearson’s notation of ¢,(v? + w*) for the integral 
above. Then, on the hypothesis a = 0Oandk = ki > 0 


(3.5) dP(V sv, W S w) = (20) Uolki)"e"ba(0” + w’) dv dw. 


This differential does not readily yield the marginal distribution of W. Probably 
a convenient approximation is to substitute Pearson’s Laguerre function ex- 
pansion for ¢,(v” + w*) and integrate v. out (see Section 6). 

The distribution of R can be obtained by transforming (3.5) into polar co- 
ordinates, 


kyrcos8 ) 
. PV s ig etttiiite Sere ». 
(3.6) dP(V s rcos8 < rsin 8) TATA G r dr dg: 


whence dP(R S r) = [Io(ki)|""Jo(kir) ¢,(r”) r dr. Integration by parts then 
gives 


P(R sr) = Ua(kdT*| Ilka) PO, n) — [ ki I,(ky s) P(s, n) as], 


where P(r, n) is the function of (1.3). This last distribution is, in fact, the power 
function of R against the Mises alternative a = 0 and k = k, > 0. It is also the 
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power function against the Mises alternative a = a* (unknown) and k = k, > 0, 
because F is unaffected by increasing all the ¢, by an arbitrary constant. The 
distribution of & can be evaluated after expanding (3.6) in Laguerre functions. 


4. Calculation of Kluyver’s integral. The integral has been evaluated for 
nm = 6(1)24 andr = 0(.1)6(.5)12(1)n, and for such other values at intervals of 
.05 or .1 as were necessary to embrace the 5 percent and 1 percent points of R. 
Table 1, as published here, is an abridgment. Evaluation of the integral for 
n = 3, 4, or 5 would have required special treatment, such as: (A) tabulating 
J; to values much greater than the present upper tabled limit of 100; (B) re- 
placing J» and J, by their asymptotic series and then expanding the integral 
into a series of sine integrals for n = 3 or 5, or a series of Fresnel integrals for 

= 4; (C) an extension by quadratures as performed by Pearson and Blakeman 
({14], pp. 16-20), but carried on digitally instead of graphically. For n = 2 
the distribution of R is simply (2/7) arc sin r/2. 

The integration formula used was the integral of Bessel’s interpolation formula 
carried to 8 differences. For an interval of integration h = .04 was used forr < 
10, and h = .02 for r = 10; agreement at r = 10 was good. 

There were three chief sources of error in the integration: 

(A) The integration was truncated at rz = 100, the upper limit of the J; table. 
This error is larger than that explained in (B) following for n = 6, 7, and 8; for 
n = 4 or 5 it would have been prohibitive. For n = 7 arguments for J; above 
100 were supplied. 

(B) The integration was often truncated for |.Jo(x)|" less than 10~°. This was 
by no means systematic, however, since for many of the integrations additional 
arguments were conveniently included. 

(C) The remainder term, involving ninth differences and greater, was neg- 
lected. 

The values P(1, nm) = 1/(n + 1) and P(n, n) = 1, known by elementary 
analysis, furnish a partial check on the accuracy of the integration. In most of 
the table discrepancies were less than 10-°, and nowhere were they as great as 
10°. We believe that Table 1 is accurate to 5 decimals as given, with the ex- 
ception of n = 6, where an error of 1 in the fifth place can easily have occurred, 


5. Interpolation of percentage points. Because of discontinuities in its deriva- 
tives with respect to r, Kluyver’s integral (1.3) presents problems of interpola- 
tion, both direct and inverse. These discontinuities have been discussed by 
Rayleigh [17] who investigated the behavior of the leading term in the asymp- 
totic expansion of the Bessel function product. He found that the derivatives 
of order 4(n — 2) for n even and 4(n — 3) for n odd are discontinuous‘ at 
the points n — 2k, for k = 0, 1, 2, 3, etc. Although the integral 


[ * (Sola) "Iilrx) ae, 


‘ For a discussion of discontinuities in the sum of n rectangular variables, see Cramér 
[3], p. 245, and Fisher [4], p. 298. 
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where the upper limit of quadrature X is a large number, has continuous de- 
rivatives of all orders, these derivatives become large for the same points n — 2k. 

For the problem of inversely interpolating the 5 percent and 1 percent points 
(Table 2), the singularities of (1.3) were not particularly troublesome because 
none of the desired points happen to fall uncomfortably close to a singularity. 
Thus, for the interpolation of Q(r, 6) = .05, the function was evaluated for the 
five arguments 4.10, 4.15, 4.20, 4.25, and 4.30, which do not include the sin- 
gularity r = 4.00. Although singularities are less dangerous for large n—since the 
order of the first discontinuous derivative increases—they were avoided when- 
ever convenient, and in no case did an interpolation use points on both sides of 
a singularity. Table 2 presents interpolated 5 percent and 1 percent points for 
r, and for several functions of r that may be useful in making significance tests. 
The first three of these functions—namely, r, r/n, and r’°—should all be correct to 
four significant figures, as tabled. The function z, given to five significant figures 


for comparisons in Section 7, may possibly contain an error of about 2 in the 
last figure. 


6. Pearson’s asymptotic expansion. Pearson derived an asymptotic expansion 
for the probability density function (3.4) corresponding to (1.3). This expan- 
sion is, in effect, a series of Laguerre functions 


fr) =e" Sa c:Li(kr’), Lia) = e€ x ee -2 aes ar 


To derive the c;, Pearson essentially used [Jo(x)]” as a moment generating 
function, thus obtaining the expansion 


6.1) L [olay Jotrade da = + o* ¥ es Lite 
or, with z = r’/n, 
(6.2) 2ar dr = E [Jo(x)\"Jo(raz)a dx = dze* > c, L,{z). 


Each c; ({14], p. 9) is of order n”, where @ is the integral part of 3(1 + 7). Pearson 
derived the c, through cs , which contains terms in n~° 
From the well-known relation 


[ e L((z) dz = e*[L(z) — tL.-a(z)), 


the series on the right of (6.2) may be integrated term by term as 


af Q 
(6.3) Q(r,n) =e *sl + dX elLi(z) — iLeste)lh. 
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Rearranging (6.3) in descending powers of n and omitting contributions of 
order n“‘ and n~*, which appear in cs and cg, we get 


12n? 2} 3l 4! 
692° 1632 | 1452" | 


f 2 2 3 4 

Q(r,n) =e E +x(2 -Z)+ 5 (-z me ~) 
(6.4) - 
TET FER aR 
This series is more convenient for computation than (6.3). Table 3 compares 
for n = 7 and 14 the value of P(r, n) = 1 — Q(r, n) obtained by quadratures 
with Rayleigh’s approximation P(r, n) = 1 — e *, Pearson’s approximation 
(6.3) through cs, and the approximation (6.4) through n™. It appears that 
either (6.3) or (6.4) gives roughly three-decimal accuracy for n = 7, and that, 
if anything, (6.4) is a little better than (6.3). Although three reliable decimal 
places provide a fair degree of accuracy over most of the curve, they are clearly 
insufficient for estimating percentage points in the extreme tail. 


1 42° 
in Eh ee OR nie 
+ sal eae 


7. Series expansion of the percentage points. Having obtained Q(r, n) as 
an expansion in powers of z and n-!, we now attempt to obtain an expansion 
for z in terms of Q(r, n) and n-. Since the Rayleigh approximation above ob- 
viously leads to z = —InQ(r, n), it is convenient to set 


(7.1) y = —ln Q(r, n) 
and expand z in powers of y. To do this, set 
(7.2) z=y+ > an. 
Then e’Q(r, n) = exp >. an‘, whence we obtain 
am =3(2y—y°*), a2 = Ay(12y — 3y* — y/’), 
Qs = xhs(—12y + 42y* — 8y* — y/). 


For n = 7, 14, and 21, Table 4 gives the successive approximations to the 1 
percent and 5 percent points and compares them with the values obtained by 
inverse interpolation. Even when four terms (through a;) are used, good ac- 
curacy is not achieved for n = 7. Three terms give fair three-place accuracy 
for n = 14 and four-place accuracy for n = 21, so that three terms suffice to 
extend Table 2 beyond n = 24. However, three terms may be inadequate to 
obtain points in the extreme tail, such as the 0.1 percent point, if these are 
desired. For example, Q(12, 21) = .000,648,77 by quadratures. The three-term 
approximation for z is 6.8590 against the correct value 144/21 = 6.8571. The 
fourth term is —.0015, and the four-term approximation is therefore 6.8575. 


8. An example of the R-test. Forrest [5] gives 651 observed times of break- 
down due to lightning in the British electrical grid system for the eight years 
1940-47. The observations fall into two groups: 588 for the summer months 
(April-September) and 63 for the winter months (October—March). The summer 





TABLE 3 
Various calculations of Kluyver’s iniegral for 
n = 7and 14 


P(r,n) =r [ [Jo(x)]” Ji(rx) dx 


a | 
P(r, ») by quadrature | (Rayleigh) aay em - (6.4) through a7? 


-03508 ’ 
.13312 : . 12500 


27489 ‘ - 26078 
-43528 ‘ -41819 


59052 ' | 57497 
72355 .71339 
82623 | , | .82289 
89830 ‘f .90082 


.94458 .95046 
.97188 | .97846 
.98672 .99219 
.99416 .99785 


-99761 


83105 
-88548 
-92570 


95397 
-97285 
-98481 
-99196 


99601 
99815 
. 99921 
. 99969 


99997 
1.00000 
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TABLE 4 
Various calculations of the 5 and 1 percent points for 
n = 7, 14, and 21 


5 percent points ‘ peo points 
(r, m) = .95 (r, n) = .99 


y +2.9957 3227 +4.6051 7019 
ay —0.7457 3683 —2.9993 1302 
a2 —0.2480 4699 —1.4725 7372 
a; +0.1574 8945 —1.3736 8651 
n= i 
y + a,/7 
y + a:/7 + a2/49 
y + a:/7 + a2/49 + a;/343 
z, by interpolation 
n= 14 
y + a,/14 
y + a:/14 + a2/196 
y + a:/14 + a2/196 + a;/2744 
z, by interpolation 
n = 2) 


8892 1767 
8841 . 1466 
8846 1426 
8819 1561 


bt bo bt 


bo 


.9425 .3909 
-9412 3834 
9413 3829 
-9413 .3829 


tS bb bo 


bo 


y + a,/21 

y + a:/21 + a2/441 

y + a;/21 + a2/441 + a2/9261 
z, by interpolation 


.9602 .4623 
.9597 .4590 
.9597 .4589 
.9597 .4589 





bo bo bh be 


breakdown times have a rough resemblance to a Mises distribution with a 
conspicuous mode at about 1700 hours, though the fit is not first-rate, owing 
to an apparent secondary mode in the early morning. The winter breakdown 
times are given in the table below, and it is by no means certain that this dis- 
tribution differs significantly from the uniform. 

We propose to test the hypothesis k = 0 against the alternative k = k* > 0 
and 0 S a < 2z. To this end, we reduce the midpoints of the hourly time inter- 
vals to angles (0-1: 7°.5, ete.) and include the corresponding sines and cosines 
in Forrest’s table. 


G.M.T. Freq. Sines Cosines ’ Sines Cosines 
0-1 . 1305 .9914 . 1305 .9914 
1-2 .3827 .9239 . 3827 .9239 
2-3 .6088 .6088 . 7934 
3-4 .7934 . 7934 .6088 
4-5 .9239 -9239 . 3827 
5-6 .9914 .9914 . 1305 
6-7 .9914 .9914 . 1305 
7-8 .9239 .9239 .3827 
8-9 .7934 .7934 .6088 
9-10 .6088 .6088 . 7934 

10-11 -3827 . 3827 .9239 
11-12 . 1305 .9914 


KF onowrNK WOH ee 
noe fF. eB NHN SH Oe 
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From this layout, it is a simple matter to calculate the statistics 
W = —13.3393, V = —3.2652, R° = 188.59. 


Since the corresponding percentage points for r° have not been tabulated for 
n > 24, it is necessary to calculate an approximate value for z after the method 
of Section 7. From the two-term formula (7.2), specifically y + a,/63, the 5 
percent approximation 2.9835 is obtained, which compares with R’/n = 2.993. 


9. Unsolved problems. We have indicated in Section 3 how the distribution of 
& — a may be obtained for known k. To test the significance of an observed 
discrepancy with no a priori information about k is an obvious extension of the 
problem in linear statistics solved by the ¢-test. For large n one can replace k 
by & and treat it as known; for small n the appropriate test is unknown. 

On the assumption of discrete sample points measured without error, (1.3) 
is a suitable mathematical model. Forrest’s data, to which we applied (1.3), 
are sparse data grouped at the midpoints of class intervals. We do not know the 
effect of such grouping on V, W, or R, or on any estimators derived from them. 

Although we have shown that FR provides a likelihood ratio test for k = 0 
against the composite Mises alternative 0 S a < 2x, and k = k* > O, and 
that V provides a likelihood ratio test against the Mises alternative a = a, 
and k = k* > O, we feel that these results are by no means conclusive because 
of the existence of other unimodal, symmetrical p.d.f.s. For example, Levy 
[9], [10], Mareinkiewicz [11], Perrin ({15], p. 20), Wintner [18], [19], and Zernike 
({20], pp. 477-478) have discussed wrapped-up normal and Cauchy distributions 
—that is, the angle £ is assumed to have the normal or Cauchy distribution but 
the angles &£, &£ + 27, & + 4m, ---, are indistinguishable. Arnold ([1], p. 6) 
has shown that V and W jointly provide consistent estimates of the parameters 
of this distribution. Brooks and Carruthers (({2], p. 199) have translated a bivari- 
ate normal distribution into polar coordinates with the origin removed from the 
center of the distribution and have integrated out with respect to the radius. 

Arnold and Krumbein have indicated problems wherein angles are identifiable 
only modulo +. A workable approach is to use the distribution Ce“ and the 
statistics 


V2 = >> cos 2,, W. = > sin 2¢,, R, = VV3 + Wi. 
This solution is not unique, and the merits of alternative solutions should be 
studied. 

Spherical distribution problems have received some attention, both as gen- 
eralizations of circular problems and on their own merits. A few examples might 
be mentioned. Arnold [1] has extended the Mises distribution and the wrapped-up 
normal to the surface of the sphere. Rayleigh [17] has shown that the spherical 
analogue of R has for distribution a set of elementary functions abutting with 
discontinuous derivatives as in Section 5, and Fisher has given an explicit formula 
for this distribution. The spherical analogue of the V statistic is distributed as 
the sum of n rectangular variables (Fisher [4], p. 296). 
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ON THE EFFICIENCY OF EXPERIMENTAL DESIGNS! 


By Sytvain EHRENFELD 
Columbia University 


1. Introduction. Many models used in statistical investigations can be formu- 
lated in terms of least-square theory. The models that will be discussed can be 
stated as follows. Let y, , --- , yw be N independently and normally distributed 
random variables with common variance o*. It is assumed that the expected 
value of y, is given by 


(1.1) E(ya) = Bitar + Bok a2 + a + BeLap » a> i, ae N, 


where quantities x,.; fori = 1,--- ,p;a@ = 1,--- , N are known constants and 
8,,°:** , 8p are unknown constants. The coefficients 8, , --- , 8, are the popula- 
tion regression coefficients of y on 2, 2, ++: , X» respectively. In matrix nota- 
tion the above model can be expressed as 


(1.2) E(y) = XB 


Zu *** Ly 


By Im*** 2 YN 


The p column vectors in X will be denoted by 2, 2%,---, x, where 
x; = (xj, %2;,°°* , tw;). In some cases the experimenter has some amount of 
freedom in the choice of the p vectors x; . The efficiency and sensitivity of the 
design may be very much affected by the choice of the design matrix X. The 
choice of this matrix is equivalent to that of p vectors in N-dimensional Euclid- 
ean space. 

A simple illustration is furnished by the following example. Suppose y. are 
independent random variables with equal variance o , where E(y.) = 
Bi(ta — £) + B62. The z’s are assumed to be fixed constants. Suppose, further- 
more, that we have N pairs of observations (x1 , yi), --- , (tw , yw) and we want to 
estimate 8; . It is known that the variance of the least square estimate of 6; is 
inversely proportional to )>. (ta — )*. Hence, if we could choose values 
Zi, *** , vy in a domain 7, we would choose them such that 24 (tq — #) isas 
large as possible. 

In Section 2 we will prove a theorem about quadratic forms which, with the 
aid of other considerations, will motivate a criterion for the efficiency of a de- 
sign matrix X. In Section 3 two theorems will be proved to aid in the applica- 
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tion of this criterion. In Section 4 applications will be given, and it will further- 
more be shown that the Latin square is most efficient in a certain sense. 


2. Measure of efficiency of design matrix. 

2A. A useful inequality. Suppose we have a real p x p symmetric non-nega- 
tive matrix S of rank r < p. Let ¢ be a column vector with p components, not 
all zero, such that the equation Sp = ¢ has solutions for p. Then we have the 
following. 

THEOREM 2.1. Jf p is any solution of Sp = t, then the following inequality holds: 


1 p’ Sp 1 
: S-. 6 
@ 1) Amax i. ok Amin : 


where \max ANd Amin are the maximum and the minimum of the nonzero character- 
istic roots of S. 

Proor. There exists an orthogonal matrix O such that O’SO = Q = ()38;;), 
and therefore p’Sp = > Asp. . The Ax, --- , A, are the nonzero characteristic roots 
of S, and p* = O’p. Since Sp = t, we also have that ¢/t = os Net’ since t’t = 
p*0' S'00’ SOp* = p*’2'p*. Thus if we let z; = ~/\,p* , we have 


(22) p’'Sp _ Duirs pr Doi zi 
2 ae 


and since 


(2.3) eae gt 
Annes 2 ri 2 Amin 


we get equation (2.1). When S is of full rank, equation (2.1) becomes 


ae 1 St % 
(2.4) hh ae or 

For the most part we will restrict ourselves to the case where S has full rank, 
In that case S has an inverse, and p = St. The case where S is not of full 
rank can be treated in a way very similar to the case of full rank. Some of the 
problems that arise in connection with the model which was considered in 
Section 1 will be discussed. 

2B. Estimation. Suppose S = X’X is of full rank, and we want to estimate 
6, where @ = >-? 1,8; = ¢’8 and the ¢; are given constants. 

From least-square theory it is known that the best unbiased linear estimate 
of 6, in the sense of minimum variance, is 6 = > tsb; where the 8; are the least- 
square estimates of 8;. Also, it is known that the variance of 6 is ot'S't. If 
t is such that Sp = ¢ has solutions for p, we say that @ = ?’8 is estimable. Only 
estimable @ are here considered. When S is not of full rank, the variance of 6 is 
op’ Sp where p is any solution of Sp = t. 
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2C. Power. We will now derive an expression that will be used in Section 4. 
Suppose S and S~ are partitioned into 


es 
(4 B ; 
= S$", 
B’ ) 


where a and A are k x k matrices. If we wanted to test 8; = Bp = --- = & = 


0, then the usual F test has a power function depending monotonically on a 
parameter ® where 


(2.7) o® = B’A~'B, B’ = (Bi, = +e Bx). 
Since A~’ = a — c’d™'c, we have 
(2.8) ob = pap — (cB)'d'(cB). 


2D. Confidence interval. If >> t; = 0, then @ is called a contrast. A confidence 
interval for 6 with confidence coefficient 1 — a is 


(2.9) 6-—k oe 5056+k. 65, 
where k, is an appropriate constant. Here 64 is defined by 


(2.10) 65 = FS t, 


where é° is the usual estimate of o°. If we let L equal the length of the con- 
fidence interval, we have 


(2.11) (E(L)Y = Mo't'S“'t, 


where M is independent of o’. 


2E. Application of inequality. A simple application of Theorem 2.1 to some 
of the previous expressions gives 


o ‘ cv. 
(2.12) x, vt S Var (6) = 


F Mc ‘ . . a 
(2.18) aadt" < (A(L)) “i Amin 


t’t, 


t’t. 


If we wanted to test 6, = --- = 8, = 0, we would get 


(2.14) ou FS ppt zee =. 
o o o 
It is to be noted that all these bounds can be attained. If (2.12), (2.13), and 
2.14) are considered, it can be seen that it would be desirable to make Amin aS 
large as possible. With this in mind, we define the following as a criterion for 
the efficiency of a design. 
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DeFtniTion. Denote by u the maximum value of \,,jn for x.; in T. The ratio 
Of Amin/u will be called the efficiency of the design. The design will be called 
most efficient if the efficiency of the design is equal to one. 


3. Extension. In order to apply this criterion of efficiency, it will be helpful 
to have the following two theorems. 

THEOREM 3.1. If 2:2; = C; where c; = O are fixed constants, then X.»in will be a 
maximum when x;«; = 0 for i # j. 

Proor. Let c be the smallest of c:, --- , cp and designate 


c; —A 0 


S(\) = S —_ XI, S(A) = 


0 CG —A 
and let |S(A)| = |S — XJ| be the determinant of S(A). We have that |S(A)| = 

(ce; — ). Since S(O) and S(0) are symmetric and non-negative, we also have 
|S(0)| = O and |S(0)| = 0. Let us now consider the matrix S(c). Suppose first 
that S(c) is non-negative. We then have 0 S |S(c)| Ss |S(c)| = 0, and thus c 
would be a root of S(O) and thus Anin S c. Suppose, however, that S(c) were 
not non-negative. Then there exists a root } < 0 such that |S(c) — XJ| = 0, 
or equivalently |S(0) — cI — XJ| = |S(O) — (c + X)I| = 0. Thusc + Lisa 
root of S(0). Since \ S 0, we have ce + \ S c which gives the desired result. 

When x;2; = Ofori ¥ j, 2; is called orthogonal toz;. When the p vectors 
are mutually orthogonal, then c, which is the upper bound of \,j, can be at- 
tained. In some situations one can maintain the condition of orthogonality and 
increase c at the same time. This theorem is usually useful only if the experi- 
menter has some freedom of choice in all vectors x, , 72, +++ , Zp. 

In many situations, however, the vectors 2, --- , z, are fixed, and there is 
only freedom of choice of z,4:1, ++: , 2». In those cases the following theorem 
is sometimes useful. Its proof is very similar to the one used for Theorem 3.1. 

Suppose z;, ++: , 2, are fixed vectors. Then S can be represented as a par- 
titioned matrix as follows: 


Alb 
(3.1) S = S(b, D) = (+= 

b’ | D 
where S(b, D) indicates that S depends on b and D. Also we have 
A = (2,23), tijj=l,-°-,7, 
(3.2) ’ 

D = (2;2x;), iij=rt+i,---,p. 

S(b, D), A, and D are symmetric and non-negative matrices. Let the charac- 
teristic roots of A be 0 S A1 S --- S X,, and thoseof DbeO S&S & 


- S 6,.,. The 2’s are fixed, but the 6’s depend on the choice of 2,41, --- 
Xp. 


bs 
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TueoreM 3.2. If p is the smallest characteristic root of S = S(b, D), then 


(3.3) ¥ SA. 
when b = 0 and Mi s 6,, then y = M4. 


Proor. There exist orthogonal matrices T, Q such that 
A 0 6; 
(3.4) T'AT = oboues QDQ=| — 
A, 


If we let 


a 
as pu a 
09 Q 


then P’SP has the same characteristic roots as S, since P is an orthogonal 
matrix. We also get 


Since b* is zero if and only if b is, we have by use of Theorem 3.1 that 
W S min (A; --: , Ar, 01, «+ , @p-+) S&S 1 Which proves the theorem. 

In terms of the vectors 2,41, --* , Z» Theorem 3.2 means that to maximize 
Amin We must choose 2,2; = Ofori = 1,-:-,r;j =r+1,---, p. In order 
to attain the upper bound ); , we must also make Ai S 4. 

To do this we might use Theorem 3.1, and make z,z, = 0 for t, s = r + 1, 
++», pand 2,2, = fort = r+ 1,--- , pif possible. 


4. Applications. 
4A. Hotelling’s weighing problem. Consider the weighing of N linear com- 
bination of p objects on a chemical balance. This gives rise to the following 
equations: 
(4.1) E(ya) = Lai1 + ee + Lapp 5 
or 
. Zip 
E(y) = Xa, X = 
Tn os INnp 
where w; is the weight of the ith object. 
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[+1 if ith object is placed in left pan at ath weighing, 


(4.3) tei = {—1 if it is placed in right pan, 
0 if it is placed in neither. 


Thus the vectors x; , 22, --* , 2» are vectors whose components are +1, —1, or 
0. From Theorem 3.1, it follows that the most efficient design for this problem 
would be gotten by making z;2; = 0 for i ¥ j and to make z,z; as large as pos- 
sible. This can be done by picking 4; = +1 or xa; = —1, then 2,2, = N. The 
vectors 2, --- , Z» cannot be chosen as indicated for all N. The problem of 
when and how this can be done has been thoroughly discussed in the literature 
3}. 

4B. Power. From equation (2.8) it can be seen that mings<; ¢ @ would be a 
maximum if c = 0 and the minimum root of a is large. The condition c = 0 is 
equivalent to making 2,2; = Ofori = 1,---,k;j7 =k + 1,---,p, and from 
Theorem 3.1 the Amin of a can be made large by making z;2; fori = 1, ---,k 
as large as possible and 2,2, = 0 fors ¥ t; s, t = 1, --- , k. Any linear hypothesis 
can be put into canonical form so that it becomes 8; = 6. = --- = & = 0. 

4C. Analysis of covariance. The model usually used for a two-way layout can 
be stated as follows. 


(4.4) E(yis) =e + Q: + T; + B(2,; m~ Z), = 


where the Q’s and 7',’s are row and column affects respectively. Associate with 
each pair (7, 7) an integer a such that a(t, 7) = (¢ — 1)s + j fora = 1 
rs. Then (4.4) can be written as 


(4.5) E(ya) =u + > tiaQi + > aT’; + B(2a 7 £) 


where 


, , 


( +1 if ath plot is in ith row and 0 otherwise, 
4.6) 
Vja = +1 if ath plot is in jth column and 0 otherwise. 


In matrix notation the above becomes 


E(y) _ X 6 _ (1 hie loa = bre Via °** Vea (te _ €)) 


It can be seen that 21, --- , rat, Tr42,°°* , Tree41 are fixed vectors in the 
mode]. The only choice available to the experimenter is in the last vector 2,4,.+2.. 
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An application of Theorem 3.2 gives us the following advice for getting a good 
design. Make 2, .427,4.42 a8 large as possible, where 


(4.8) Tr 4048r4042 - bi (a as #) - Di. (Xi; _ #)’. 


Furthermore, it is desirable to make x,,,,9t; = Oforj = 1,---,r+s+1 
simultaneously, that is; 
(4.9) Doe lia(ta — F) = Do; (xi; — %) = 0, 
(4.10) Doavjalta — Z) = D: (2,; — Z) = 0. 
Equations (4.9) and (4.10) thus give conditions for an efficient design for this 
model. 

4D. Latin square. Suppose we wish to find out by experimentation whether 
there is any significant difference among yields of m different varieties 1, , --- , 
vm . An experimental area is divided into m’ plots lying in m rows and m columns 
and each plot is assigned to one of the varieties 1, --- , vm. Denote by yi 
the yield of variety »% on plot in 7th row and jth column. It is assumed that the 
yi jx are independently and normally distributed with common variance o’. 


(4.11) E(yin) = ki + dj + pe 


where k;, d;, px are the row, column, and variety effects respectively. The 
quantities o, k; , d; , p, are unknown parameters. The hypothesis to be tested 
is that variety has no effect on yield, i.e., 


(4.12) aS ee 


Wald proved that the Latin square was most efficient in a sense which he de- 
fined in [1]. We will prove a similar result in this section. 
THEorEM 4.1. The Latin square is most efficient in the sense of our definition. 
Proor. As was done in (4B) we let a(z, 7) = (¢ — 1)m +) fori,j = 1,---, 
m. Let tia, Uja » Zea for i, j,k = 1, --- ,m;a@ = 1, --- , m’ be defined as follows. 


tia = +1 if ath plot in ith row and 0 otherwise, 
(4.13) = +1 if ath plot in jth column and 0 otherwise, 
Zea = +1 if kth variety is assigned to ath plot. 


Then (4.11) can be rewritten as follows. 


(4.14) E(ye) = - Kitie + } djja + 7 PrZka « 
ra k=l 


j=l 


We want to express the hypothesis in (4.12) in canonical form. In order to do 
this denote the arithmetic means 


(4.15) i = i > ben's i; = i + a Bias 2; = - @ Zia 


m= m?* “a m? “a 
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and then let te = tia — bi, Usa = UWia — Wi, Zia = Zia — 2; and k; = 


km, @; = di — dm, p: = pi — pm fori = 1,---, m — 1, and let w, 
a = 1,---,m’. Then (4.14) can be written as 


m—1 


m—1 m—1 
(4.16) E(ya) = § Wa + » kitia + » djtia + dX pi2ia; 


where — = re kits + ae dja; + =a pz: + km + dm + pm, and the 
hypothesis in canonical form is 


, 


(4.17) . coe mt pny = QO, 
In matrix notation the design matrix X becomes 
(4.18) XxX = (1 tie ‘. P ae lied Ute or Un—1,0 Zia 7 Cad 


The experimenter has freedom only in the choice of the z;.,% = 1, «:- 
m —1;a@ = 1,--:, m’. The z;. depend on the way the varieties 1, --- , vm are 
assigned to the m’ plots. In the Latin square arrangement each variety ap- 


pears exactly once in each row and exactly once in each column. The S matrix 
for the above model is 








and we let D = (z;z;) and 
2! 
m | 
ot —— 


| 
im— 1 


It is known that for the Latin square 


(4.21) + Zatlja = » ta Zratia -™ Bae Cala = 0, ty Js k=1,-++,m—1, 
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which is exactly the condition of orthogonality needed in Theorem 3.2, namely 
b = O. For a Latin square the D matrix becomes 


m-—l1 —1 


(4.22) 


and if we use the fact that 


a 


= a "la — kl, 


Ql = 


A ee a—1\| 


where Q is a k x k matrix, it is thus seen that \,; S @,, which proves the theo- 
rem. 
Acknowledgement. I would like to thank Professor Henry Scheffé for many 
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ON THE APPROXIMATION OF A DISTRIBUTION 
FUNCTION BY AN EMPIRIC DISTRIBUTION’ 


By Jerome BLACKMAN 
Syracuse University 
1. Summary. Let x;,---, 2, be independent chance variables with the 
common distribution function F(x) and the empiric distribution function F*(z). 


Let a, be the value of a which minimizes (1) below. In this paper the asymp- 
totic distribution of +/n a, is obtained, subject to certain restrictions on F(z). 


2. Introduction. Let x, , x2, --- , x, be nm independent random variables hav- 
ing the common distribution F(x). Suppose b; , bo, --- , 6, are the n random 
variables in ascending order of values and F*(x) is the empiric distribution 
function, continuous on the right, with jumps of magnitude 1/n at the points 
b; , bo , --+ , b, . Define the function H(a) by 


(1) H(a) = [ [F(a — a) — F*(x))? dF(x — a). 


Thus H(a) is non-negative for all a and since it is a Borel-measurable function 
of the random variables {2,}, it is also a random variable. The value of a which 
minimizes H(a) will also be a random variable. If the minimizing value of a 
is a, , we shall be concerned with the limiting distribution of a, as n — «. Our 
main result is the following. 

TueroreM 2. If the first three derivatives of F(x) are continuous and bounded, 
then 


; 1 uz/y ee 
lim P(/na, < x) = sal evr dy, 
n—-20 at Jax 


where 


ol ey ey =z 2 
| F(R) at — | at | wr) at| ay, 
Jo | Jo Jo x Jo | 


0 


x 


mM [ \F’(a)|* dF (x), F(t) = sup {x | F(x) = ¢}. 


We shall henceforth assume the conditions of Theorem 2 are satisfied. In 
what follows repeated use will be made of an important result due to Kolmo- 
goroff [2}. 

THEOREM. Suppose that F(x) is continuous, and define the random variable D,, 
by D, = l1.u.b.z |F(x) — F*(x)}. 


Received June 17, 1954. 


1 Work on this paper was done under sponsorship of the Office of Naval Research. The 
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Then for every fixed z => 0, asn—> ©, P(D, < zn“”) — L(z), where L(z) 


is the cumulative distribution function which for z > 0 is given by either of the 
equivalent relations 
( ) 


L(z) = 1 — 22) (-1)"" exp(—2y*2*) = ve 2 exp )— oo eas. 


pool 82? 


v=al 


For z <= 0 we have, of course, L(z) = 0. 
The Kolmogoroff theorem implies that F*(x) — F(x) uniformly in prob- 
ability and that therefore a, — 0 in probability. Expanding the right side of 


(1), 


H(a) = / {F(a — a) — F*(x)}* dF(a — a) 
= [ F’(x — a) dF(x — a) — 2 / F(a — a) F*(x) dF (x — a) 
+ / F* (x) dF(x — a) 


o He [ F(z — a) dF*(z) — [ F(a — a) dF**(z). 


‘Therefore 


( 2 2 \ 
(3) H’(a) = 2 F(x — a) F’(a — a) dF* + [ F'(z — a) aV*(z)>. 


Each a, must satisfy the equation H’(a) = 0. It will be seen below that all 
solutions of H’(a) = 0 which converge in probability to zero as n — ~ will 
have the same limiting distribution. Putting H(a,) = 0 we obtain 


Qn" Do F(b; — an) F"(bi — an) = wv * Df? — (i — 1)*} FO, — a), 
where summation is from 1 to n, or 
(4) > {2nF(b; — an) — 21 + 1}F’(b; — an) = 0. 
Since F(x) has a continuous third derivative, 
(F(b; — an) = F(b;) — F’(bs)an + 4F"(b; — 0a,) a , 0 
F’(b; — an) = F’(b;) — F"(bi)an + 3F'"(b; — ya,)a,, 0 
Placing (5) in (4) and dividing by n’ results in 
0 = {—2n">-F(b) Fb) — n> (—2i + 1)F'(b,)} 
(6) + {2n">SF(b,)F"(b:) + 2n" DOF? (b,)) + nD (—2; + IF" (bi) Jan 
+ T,(an)a;, 


where P{|7T(a,)| S C} 2 1 — e¢ for some C and forn > N(e) by the assump- 


(5) 





258 JEROME BLACKMAN 


tion that the derivatives are bounded and that a, — 0 in probability. Equa- 
tion (6) is of the form 


(7) A, + Bian + Ta(a,)a, = 0. 

Let a, be the solution of 

(8) A, + Bra, = 0. 

Subtracting (8) from (7) gives that, with probability 21 — « for n > N(e), 


- 


| Pa(ds)| dn < Can 
\B,| ~ |B, 
It will be shown below that B, — u ¥ 0 in probability, so that 


(9) la, — a,| = 


lan — a»| / \a,| > 0 in probability. 

It is therefore necessary only to find the limiting distribution of a, where 

—A,_ sh DOF) —i/n+ 1/2) Fo) 
B, = n* > [F(bi) — i/n + 1/2n) F“(b) +n SY F*(b,) 

4 ag n” 2 [F(bi) — i/n\F’(bi)) + in > F’(b,) Ee are 
n >. [F(b;) — i/n) F’(b) +n" + F’(b;)) + in Z. F’(b;) | 
3. Lemmas. Three lemmas are useful in the proof of the main result, Theo- 

rem 2. 

Lemma 1. For every « > 0 


lim P( |n~™? F’'(b)[F(6,) — i/n| — wv v2 >> F'(F(i/n) )[F(b)) — i/n]| > © 


no 


, 
An 


(10) 


ProorF. Since 
n'? n>” |F'(b:) — F’'(F'(i/n))|  |F(b) — i/n| } 
< Lu.b.; |F’(b;)) — F’(F™'(i/n))| Lub. |F(O) — t/n| nr, 
we need, by the Kolmogoroff theorem, show only that 
l.u.b., |F’(b,) — F’(F“(t/n))| — 0 in probability. 


Suppose « > 0 and 7 > 0 given. Let A be the linear set such that F’(x) > 
n/4 for x ¢ A and let A’ be the complement of A. Let D = (6; , F~(i/n)) be the 
open interval with end points b; and F~'(i/n), and M = 1.u.b., |F”(x)|. By the 
Kolmogoroff Theorem there is an N such that forn > N 


(11) P(iu.b.; |F(b;) — i/n| = 1°/16M) < «. 


F~l\(i/n) 
Lub. | F(b,) — i/n| 2| | | F(z) dz 


/ F’(z) ao = 4ymeas(DNM A). 
DnA 


On the set of sample points for which 


Lu.b.; |F(b,) — i/n| < 1°/16M 
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we therefore have 
meas [Df A] < »/4M forall i <n. 


Either meas (D) < »/4M, in which case |F’(b;) — F’(F~'(i/n))| < (n/4M)M < 
n, or there are points 7; and 6; for all i < n such that y;, 4; ¢ A’, 


lys — b:| < 9/4M, = [8 — F*(i/n)| < 9/4M. 
Therefore, in the second case 
\F’(bi) — F'(F(i/n))| 
< |F’(b;)) — F'(yi)| + |F’ (ys) — F'(6,)| + |F’(6) — FF (i/n))| 
M \bi — yi| + 20/4 + M 8; — F“(i/n)| 
n/4 + /2 + 0/4 = ». 


Therefore this inequality is true in any case except for the set of equations 
(11). Combining our results we have for all n > N 


P(l.u.b.; |F’(b;) — F’(F“(i/n))| > 0) < 


which proves the lemma. 
Lemma 2. Let Ho(x) = F’(F‘(x)) for 0 S x S 1 and 


H,(x) = tf H,,(t) dt. 


Then both 


a1 
(a) | a 'H,(x) dx < «, 
Jo 


(b) +» (—1)"*"'H, (a2) = =| t Ho(t) dt, uniformly in zx. 
k=l “ /0 


Proor. Let M = 1.u.b.. |F”(x)|. Consider the curve y = F’(z) at the point 
(x, F’(x)). If from this point a line of slope M is drawn to the z-axis, the line 
must be completely on or below the curve y = F’(x) in (— ©, x), by the mean- 
value theorem. Since F(x) = f*.. F’(x) dz, it follows that 


(13) F(x) = 3F"(x)/M. 


This implies that lim,.o, Ho(z) = 0 and by an obvious induction that 
lim, .o, H,(x) = 0. 
Differentiating (12) gives H,(a) = —x'H,(x) + 2 ‘Hy-1(z), or 


(14) [ aH.) de = [2 Hye) de — Holy) 
d /0 


Therefore the truth of (a) for n — 1 implies it for n and we need show 
fox 'Ho(x) dx = J®.. (F’(x)/F(x)) dF < @. Define the set E by 


E = {x | F’(x)/F(z) > F-**(z)}. 





260 JEROME BLACKMAN 


If E’ is the complementary set, 
1 
as) f w@sr@yar s [ re @ars [ 2 az = 2. 
gE’ Rg’ “0 


The set E consists of a union of open intervals, by the continuity of F and F’. 
Let [a, b] be one such interval. On [a, b] 


b b 
F(x = F'? (2), / F'(x)/F"? (x) dx = / dx 


or 2(F'?(b) — F'?(a)] = b — a. This implies that the measure of EF is 
Using the inequality (13) 


(6) f ('@)/F(@)) aF = | (P*(@)/P@)) de s 2M | ae 5 4M. 


This completes the proof of (a). Turning to (b) we note that by (14) 

[x 'H,(x) dx is a decreasing positive function of n for each y and therefore 

has a limit as n — «. Taking this limit in (14) shows that lim,.., H,(y) = 0 

for all y. Writing 
1 


/ x 'H,(x) dx = [ 
0 


Jo 


” 1 
x H,(x) dx + | x H,(x) dx 
” 


ra 


1 
s [ 2° He) de t+ [ x HA) ae 
~“0 n 


we see that 7 can be chosen so as to make the first of these arbitrarily small, 
and that the second then goes to zero by bounded convergence. Therefore 
lim,.« J, 2 H,(x) dx = 0. From (14) we obtain 


DT Ailz) = [ en dt — [ tH, (8) dt. 


Since | fot “H,(t) dt| < fo tH, (t) dt — 0 it follows that 2a H,(x) converges 
uniformly to f7 t'Ho(t) dt. Let J,(z) = So? (—1)*"'H;,(x). Then J,,(x) > J(z) 
uniformly as n — , and 


J, = S(-1)"' Ai (2) 
= «a [Ho — Hy — Ai + Hz --- (—1)"" Han + (-1)"Ap] 
= 2 Hy — 24 ‘Jn1+2 (—1)A,. 


The right side converges uniformly for e S z S$ 1 so that forO0 < zx S 1, 


J’ + 22°) = 2*Hd(2z), oJ’ + QJ = xH)(z), d/dx(2’J) = zHh(z), 


or J(x) = x” Jo tHo(t) dt, proving (b) and thus Lemma 2. 
Lemma 3. For every « > 0 ands = 0, 1, 2, --- 
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° ( i ¥ c. 
lim P "(b;) —-) - H.{2) d 
[Ya be (6){r00 9) - va [m0 
(F(b) -F®:+1it1, 1 1 ‘) i | 
we) = ae n rm "(h:.) — — on : 
4 = + *} + We >» Hon(? \F (b;) af > ‘| 0 


F (bi41) 
Proor. In what follows we take F(bj4:) 
es k 


1 whenever it occurs. Letting 
G, = >-'_1 H.(i/n) and using the Abel transformation 


Vat Cl 6) - 3} 


I pet 
eye eas 1} 4 Te Gal Pb) = 1) 


Vn : r wee {Feb) — Fu) + 1\ 


1 ¥ f/f \ 
n—l ~ijn ( 
=vad | He) de F(b) — F(bis) i+1 4 


a. eae +3} 


+vad{ a-f * Hsla) a} {F0) — Febos) +} i} 


= F(b) — F (bi+1) . 
- V/n : cr H,(x) a PF (b. } \F (diss) i n J 
(16) 


i+1\ 


1 
+ Vn {F(b,) — 1} 
= V/n>, > | “HuGe) de {POI = Fen) § +} 


Mae ae, 2} 
ijn ‘ 
HVvn sf) G.— [| H@) az} {Fb — Fas) + +} 
F(b) — F(a) i+] 
+Vabd {fr H,(2) aa} eee Fb) m ‘+t 
n i/n 
+Var [| Hula) dz (FO) — Pu} 


ee ve Sh 1\ 
Conk t+1 a} Pw n } 





JEROME BLACKMAN 
. [ H.(x) dx {F(b,) — {1 —7* } 


i [ Hie) ax{ ro) - 0 *+441\4 Z-ro) - 11 6, 


“0 


= a A; . 
1 

We first must show that A, , Ay, As, A¢, and A; converge to zero in proba- 
bility. That this is so for As, Ag, and A; is a simple consequence of the fact 
that since F(b,) represents the maximum of n uniformly distributed inde- 
pendent random variables, the distribution function of 1 — F(b,) is1 — (1 — 2)” 
for 0 < x < 1, from which it follows that n'"{1 — F(b,)] converges to zero in 
probability for 7 > 0. Of course, n-'G, is bounded. 

Using the Abel transformation 


(i+1)/n 
>on Sif, H,(z) dx — - A, (S++) 


nt 
i+1_ 1 
? nf 
~(n—l)/n 


( "4 e 
+ Vins = Gas 7 | H,(x) ax} < F(b,) _ F(b,, ) incon a = 


én vs et tee tel os OE (: ms 2) Lub. Fy 


i Sei, ait | 


7. n 


+ OVn4 \1 — F(b,)| + . + rb, }. 


The last term clearly goes to zero in probability since n‘ "F(b:) converges to 
zero in probability for » > 0. (The distribution function of F(b:) is 1 — (1 — 2)” 
for 0 < x S 1.) By the Kolmogoroff theorem, given « > 0, we may choose A 
and N, such that forn > N; 


P(L.u.b.; |F(b,)) — i/n| /n > A) < 


Since H,(x) is uniformly continuous, we may choose Nz so that for n > Ns 


lu.b. max |H,(x) — H, (: < ') <= 


7 i i+1 A 
< — 
<7s— 


n 


For n > max (N:, N2) the probability is less than ¢ that the first term on the 
right of (17) exceeds e. 


\A = Va lup.lr() — 2) x " Hla) de} {Fban) — FOO) pep Tj 


F (bins) — F(d,) 


le.) — of 
< Vn lub.'F(b)) — = max H,(z) Le ~ FQisa) 
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| i|* 1 dx 
CVn. b. |F(b, at a 
A Vn . ( ) N| SF(b;) & 


| ; |2 
CVn L.u.b. (bi) _ ~ [In F'(b;)|. 


By the usual argument |A,| — 0 in probability. We now return to A;. 


ao SS [ne de Flbe) » } 


n itl n 


s 
1 n 


= Set aulSihras 24 


ae n—l n (i+1)/n f : i+ 1) 
tbr “* i Hila) dz) Flbsu) — | 


e 


e i/n 
+e - 1 l H, (zx) a{ FO) — F(bix:) + } {Fb ae . 4 


3 —™ ys) as-*' + [Om az {F() — © 

+ ene h s(x) tah s(x) dx a | 
n i/n ° 

+ Vado [ H) def PO) — Feu) +1} {roux — £4 
1 27+ 1 n n | 


5 
= >) B;. 
1 


To complete the proof of Lemma 3 it is sufficient to show that |B,;| — 0 in 
probability for i = 2, 3, 4, 5. The simplest arguments suffice for 7 = 2, 3, 4. 
In order to treat Bs note the identity 


2>> (ai — aigs)aign = a} — Oka: — Do (ai — iy)’, 
where sums are from 1 to K. If we set a; = F(b;) — i/n, we obtain 
[25 {F(b,) — F(biss) + 1/n} {F(b)) — i/n}| 
< |F(b:) — 1/nf’ + \F(bea1) — (K + 1)/n|? + DO {Fs — F(bis) + 1/n}? 
< 21ub.; |F) — i/n? + ¥ {FO) — F(biss)}? + 3/n. 


From the joint distribution of the quantities {F(b;) — F(bi,:)} given in [3] it 
is simple to show that 


P{F(bis1) — F(bi) = h} = [1 — A)”, 
Therefore 


P { lub. |F(bus) — F(b)| < h} = 1 — PU? |F(bus) — Fd! 


l<i<¢n 


n 


>1-dti—avA”® = 1- 


1 
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From this it follows that if —1 < a < 0 


lim P{ Lub. |F(bis) — F(b)| < n*} = 1, 


5 
n—+00 lsisn 


so that, choosing a = —7/8, there exists an N such that, for n > N, with 
probability greater than 1 — e 


\>> {F(b) — F(biss) + 1/n} {F(big1) — i/n}| 


S lu.b.; |F(b) — i/n)? + n~** + 3/2n. 
Now applying the Abel transformation to B; , 


| E ; | 
IBs| < Vn Ee F(b) —*| + 4 + | 
i n ne! n 


n—1 | aif 


(i+1)/n 
dal 


n 
aah sla ( 
6 FT x ; | H,(x) dx Zz 5 | H,(x) dx 


+ Va|lub.|roo -! T+ oatel|s[ H,(x) dz. 


The second term on the right clearly converges to zero in probability. We 
observe that 


x | H,(t) dt = | t"[H.(t) — Hoss(t)] dt 
“0 


“0 


and that by Lemma 2 the right side converges absolutely. Therefore 
= (i+1)/n 


th Hia)de-—™~ [ Hz) ae 


ijn 


e(i+l)/n 1 


—(H. — Haysl de — | —(H. — Heyl de 


“0 


elf 3 

n—1l (i+1)/n 1 
Le 2 [ tH, — Howl del < | 

1 i/n ( 


(\H.| + |Hessl} dx = Cy. 


0 


(i+l)/n 
n 
;[ ede 


(i+1)/n 


H, dx 


¥ n (i+1)/n H, d r (i+2)/n H Q 
+ nme / x / s 
i t+ ifn + 73 2 Ji4ty/n T 
n—1 
( 1+ 2 max H,{x) : =(+ C2 In nN. 
Oszsl 


With this estimate the first term in (19) 


converges to zero in probability and 
Lemma 3 is completely proved. 
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4. Proof of main result. In statistics the random variable F(b;,;) — F(b,) 
are known as coverages and the random variables 


{[P(bi41) — F(bs))/Fins)}, t= 0,1, 2,--- , 2, 


are independent (as usual F(bo) = 0 and F(b,+1) = 1) with density i(1 — u)** du. 
From this one calculates easily that the independent random variables 
X.. ~ {Fe — Fbmi+ 1, Il 


a 3 a 


have mean 0, variance n~*{i/(2 + 7)], and E( |X;,.\° ) S 16n™*. In what follows 
it will be necessary to apply the Central Limit Theorem to sums of the form 
S, = 2 tat 4; nX;,.. Although the Liapunoff version of the Central Limit 
Theorem is usually stated for sums of the form (> #?)~“” >> Y;, in this slightly 
more general form the proof [1] goes through with no change at all. It will be 
easy to show that the Liapunoff condition, 


> lainl? E(\Xi,n\°) 

(20 tim 2 lal BUXial) 9, 
nme Ds [dial EXt a)” 
holds. Let 


a > H,(i/n) | Fa - il, Win = Vand [ Hila) de Xen, 


i=] 


pij/n 


/n 4 H(x) dx — | Pe S |, tH (t) at x ime 


t==1 


Theorem 1 shows that we can replace n~“? >>? F’(b,[F(b,) — i/n] by Va 
which is the sum of independent random variables. 
THEOREM 1. For every « > 0 


im P| WS F’(b) IF(b) — i/n] - Vn. > | = 0), 
ne 1 

Proor. By Lemma 1 it is sufficient to show that 
(21) lim P[{|Yon — Val > «] = 0. 


This probability is 
Pl|Yo.n ria Vn) | > el 


M M 
pile (—1)*{ Yin — Wen + Visin} + dX (—1)*Win — Van 
+ (—1)"* Y utive > ‘| 
M 
> : PUlYin — Win + Visin| > €/3M] 
k=O 


M 
+P] Zz (—1)* Win = V, < “| + PUY 41.2! > € 3). 
k=0 
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The sum in the second term can be evaluated as 


M 
po (—1)* Win ae Ve 
k=O 


= Vat [ E (-v maG) ae [ae ae + [% [eto ath 
: 0 k=0 0 “0 x 40 


t=1 


= Vaid e Pp (—1)* H(z) dz + 5f tH(t) at| az} Xin 


By Lemma 2 the integrals appearing in the sum may be made uniformly less 
than +/¢/27 for all M = M,. The variance of >> (—1)"Wi.. — V,. is then 
<<'/27 for all n. 

By the Tchebycheff inequality 


M 

(23) P| > (-0' Win — Val >$] 5 §, all n, all M = Mi; 
k=0 i 

(24) \Yarsin| S Wn Lub.; |F(bs) — i/n| max, H y4:(z). 


Since lim,... H.(x) = 0 uniformly, we can, by the Kolmogoroff theorem, find 
an M, and N; such that for M = Mz andn > N, 


(25) P(\Y¥ wii.n| > €/3) S €/3. 
Now fix an M, say M = max (M,, M;). Lemma 3 states that 


lim P(l|Yi.n ed Win + Vi+1.2! > e] = 0 


n--2 


for all k and all « > 0. Therefore for N > Nz 
(26) > Pll¥e.n — Win t+ Yesin| > €/3M) S &/3. 


Combining (22), (23), (25), and (26) we obtain P[|Yo,, V,| > «] S e for all 
n > max (N,, N2), which proves Theorem 1. 

It is now easy to prove Theorem 2, our main result, as stated at the outset. 
The variance of V,, which is the sum of independent random variables, is 
asymptotically for large n 


n i/n i/n z 2 
1>T [ H(t) dt — [ 2 [ t H(t) at] 
n 40 /0 x Jo 


i=1 
=[\f H(t) dt — ($f tHo(t) at| dy = 9°. 


It is easily verified that the Liapunoff condition (20) is satisfied, so that by the 
Central Limit Theorem 


1 zliy 
lim P(V, < z) = val e lt at, 
n—»20 aT J—x 





APPROXIMATION OF A DISTRIBUTION FUNCTION 


By Theorem 1, 


oneal l< 1 Ten 
(27) lim P (= Prd| Fe) -*] < r) -vel. € — 


Returning to (10), 


n 


1g 
lim 52 > F'(b;) = lim =, => F"(b) = +x | Fe) = ‘| F’(b) +0 
1 


ne 1 no <1 
in probability. 


By the weak law of large numbers, n*>? F’ (b;) converges in probability to 
[*.. F” (x) dF(x) = uw. Combining these with (9), (10), and (27) gives our final 
result, 


(u/ vz 
—¢2/2 


lim P(/n an < 2) = Te nr 


n--o 


The constants v and uw cannot be zero. This is obvious for yu. If »v were zero, it 
would be necessary to have f} Ho(t) dt = fix dxf¢ tHo(t) dt. Differentiating 
this twice leads to the equation Ho(t) = c/t, which is impossible. 
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THE EXTREMA OF THE EXPECTED VALUE OF A FUNCTION 
OF INDEPENDENT RANDOM VARIABLES! 


By Wassity HoEFFpDING 
University of North Carolina 
Summary. The problem is considered of determining the least upper (or 
greatest lower) bound for the expected value EK(X,, --- , X,) of a given func- 
tion K of n random variables X, , --- , X, under the assumption that X,,--- , 
X, are independent and each X; has given range and satisfies k conditions of 
the form Eg$”(X;) = ¢;; fori = 1, --- , k. It is shown that under general con- 


ditions we need consider only discrete random variables X; which take on at 
most k + 1 values. 


1. Introduction. Let € be the class of n-dimensional dfs (distribution func- 
tions) F(x) = F(x,,-+-- , 2.) which satisfy the conditions 


(1.1) F(a, +--+ , 2n) = Fi(ai)F2(x2) --- F,(x,), 


(1.2) [ 0?) dF; (x) = ci;, 


0 ifzx<A; 


(1.3) F (x) = 4 
Ll ifz> B;, 
where the functions g;’ (x) and the constants c,;, A4;, and B; are given. We 
allow that A; = —« and/or B; = o. Here and in what follows, when the 
domain of integration is not indicated, the integral extends over the entire 
range of the variables involved. It will be understood that all dfs are continuous 
on the right. 

Let K(x) be a function such that 


$(F) = / K(x) dF(x) 


exists and is finite for all F in ©. The problem is to determine the least upper 
and the greatest lower bound of ¢(F) when F is in @. It will be sufficient to 
consider only the least upper bound. 

Special cases of statistical interest include K(x) = 0 or 1 cone as a 
function f(x) does or does not exceed a given constant; K(x) = max (%, +--+ ,2n), 
etc.; g(x) = 2’ (given moments up to order k); g§”(x) = 1 or 0 acide as 
Received November 3, 1953, revised November 30, 1954. 
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x <b; orz 2 b; (given quantiles) ; etc. For 


lifz St, 


K(x) = 0 otherwise 


1, gf (z)=2', K(x) = 


the problem was stated and its solution found by Tchebycheff. This result was 
extended to more general function K(x) by Markov, Possé, and others. References 
and proofs are given by Shohat and Tamarkin [9]. An extension to the case 
gi’ (z) = 2 and A, = 0 was considered by Wald [10]. Recent contributions 
are due to Karlin and Shapley [7] and Royden [8}. 

For n arbitrary,k = 1, gi? = 2, A; = 0, B; = «, and K(x) = lorO 
according as ).} 2; = tor <t, Birnbaum, Raymond, and Zuckerman [3] showed 
that when looking for the least upper bound of ¢(F) we need consider only dfs 
F; which are step-functions with at most two steps. They gave an explicit 
solution for n = 2. For the case k = 3, g§?(x) = x for i = 1, 2, and 
gs? (x) = |x —- a5l°, with ¢(F) the distribution function of the sum a x;, the 
inequalities of Berry [2], Esseen [4], and Bergstrém [1] give bounds which 
are asymptotically best as n — © but can presumably be improved for finite n. 

In the present paper it is shown that if, in the general problem as stated 
above, we restrict ourselves to the subclass C* of €C where the F ; are step-functions 
with a finite number of steps, then we need consider only step-functions with at 
most k + 1 steps (Theorem 2.1); the number k + 1 can in general not be 
reduced. The same result holds in the unrestricted problem if (A) each F in @ 
can be approximated (in a certain sense) by a step-function in @*, and (B) 
¢(F) is, in a sense, a continuous function of F (Theorem 2.2). Sufficient con- 
ditions for the fulfillment of assumptions (A) and (B) are given in Sections 3 
and 4. 


2. The main theorems. Denote by C* the class of all F(x) = Fi(x) --+ Fn(#,) 
in @ such that F,,---, F, are step-functions with a finite number of steps. 
Let @,, be the subclass of €* in which each F; for 7 = 1, --- , n is a step-func- 
tion with at most m steps. 

THEOREM 2.1. 


sup ¢@(F) = sup ¢(F). 
FeC* PeCgs1 
Proor. Let F(x) = Fi(x;) --- F(t.) be an arbitrary df in @* such that for 
some j, F; has more than k + 1 steps. It is sufficient to show that there exists 
a df G in C,4: such that ¢(G) = $(F). This, in turn, will easily follow when we 
show that if F,,(2) has m > k + 1 steps, there exists a df H,(x) such that 
a) H,(2x) has less than m steps; 
b) H(x) = Fila) +++ Pa_i(tn-1)Hn(xn) is in C; 
c) o(H) = o(F). 
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By assumption F,,(z) is of the form 
° if xr<a; 
tlle ifa, S x < Qr41, 


F,(x) = 


1 if Om S 2; 


A,Sam<a< +--+ <anSB,; 
pr > 9, 
Jip: + gieP2 + +++ + GimPm = Cin, 
Con = 1; gor = 1, 


foe = gs" (ar), t=1,--- 


(0 if s<&% 
im on + 4) + +++ +b (pe + td,) 
nt) = 
| ifa, S Zz < Ga; r= 


1 if a, S x. 


In order to satisfy condition b) it is sufficient to choose ¢, and d,, - 
such a way that 


(2.1) p, + td, = 0, 
(2.2) Gud, + Jide + haha + Jimd m — 0, 
We can write ¢(H) — ¢(F) = t>-", K.d, where 


n—1 
K,; = ce °°*, Dnel, re) aii Fy(aa. 
j=l 
Let A = 0 or 1 according as the rank of the matrix 


Jor, *** 5 Jom 


is less than or equal to k + 2. Then the equations (2.2) and >01 Kid, = d 
have a solution (d,, --- , dm) ¥ (0, --- , 0). Since >-7'd, = 0, at least one 
component of the solution d;, --- , dm must be negative. Having thus fixed 
d,, -+- , dm, we choose ¢ as the largest number which satisfies the inequalities 


(2.1). This number exists and is positive. Conditions a), b) and c) are now 
satisfied, and the proof is complete. 
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Let the distance d(F, G) between two dfs F(x) = F,(2;) --- Fa(z,) and 

G(x) = Gi(2:) --- Gn(2,) be defined by 
d(F,G) = max sup |F;(x) — G;(z)|. 
lsign —w<z<o 

We shall make the following assumptions. 

AssuMPTION A. For every F in € and every 6 > 0 there exists an F* in C* such 
that d(F, F*) < 6. 

Assumption B. For every F in @ and every « > 0 there exists 6 > 0 such 
that for any G in € which satisfies d(F, G@) < 5 we have |¢(F) — ¢(@)| < «. 

Lemma 2.1. If Assumptions A and B are satisfied, 


sup ¢(F) = sup ¢(F). 
Fee Fec* 


The proof is obvious. An analogous theorem clearly holds with an arbitrary 
distance function. 

From Theorem 2.1 and Lemma 2.1 we obtain 

THEOREM 2.2. If Assumptions A and B are satisfied, 

sup ¢(F) = sup ¢(F). 
Fee FeCesi 

In Sections 3 and 4 it will be shown that Assumptions A and B are satisfied 
for certain classes C and functions K which are of interest in statistics. 

We conclude this section by showing that Theorem 2.1 cannot be improved 
without imposing additional restrictions. More precisely, for every k and every 
n there exist functions K, gS” and constants A; , B; , c;; such that the conditions 
of the theorem are satisfied and supe ¢(F) > supe,, o(F) if m < k + 1. Further- 
more, the functions g{” and the constants A;, B; , c:; can be chosen to be inde- 
pendent of j. Let 


g\? (x) = x‘, C3 = C;, ; ; j = l, meen, 
where A and B are finite. 


First assume n = 1. We can choose the constants c; and k + 1 distinct real 
numbers a; , de, *** , @4: in [A, B] in such a way that the equations 


Qip: + asp, + -++ + GhaiPear = Cs, +=0,1,---,k, 


where co = 1, have a unique solution (pi, --- , Pe41) with p, > 0 for all r (see, 
for example, [9]). Then the df Fo, which assigns probability p, to the point a, 
for all r, is in C.4:. Let Ki(x) be a nonnegative continuous function, and let 
Q(x) be a polynomial of degree k such that Ki(x) S Q(x) for A S x S B and 
equality holds if and only if x = a, forr = 1, 2, --- ,k + 1. Let¢gi(F) = 
J Ki(x) dF (x). Then for every F in @, 


o(F) = | Q@) aF@) = o(Fd, 


and equality holds only for F = Fo. 
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Now suppose that supe, ¢:(F) = ¢:(Fo) for some m < k + 1. Then there 
exists a sequence {F“} of cdfs in @,, such that lim ¢:(F) = ¢:(Fo). Since A 
and B are finite, there exists, by the Bolzano-Weierstrass theorem, a subsequence 
{F“?} of {F®} which converges to a function F* in @,, at all points of continuity 
of F*. Since K,(x) is continuous, this implies lim ¢,(F“”) = ¢:(F*) = ¢:(Fo). 
This is a contradiction since F* ¥ Fy. 

For n arbitrary let K(x, +++ , 2n) = Ki(ai)Ki(ae) --- Ki(a,), so that ¢(F) = 
oi(F:) --- oi(F,). If the conditions for the case n = 1 are satisfied, we arrive 
at the desired conclusion, making use of the condition K,(z) 2 0. 

The assumptions that K, is continuous and A, B are finite are inessential, at 
least for k = 2. This is seen from the fact that the bound of t.e Bienaymé- 
Tchebycheff inequality can not in general be arbitrarily closely approached 
when the distribution is a two-step-function. 


3. Approximation of a df by a step-function. It will now be shown that As- 
sumption A is satisfied if © is the class of distributions of the product type 
(1.1) with prescribed moments and ranges. 

THEOREM 3.1. Assumption A ts satisfied if © is the class defined by (1.1) to (1.3) 
with g\?(x) = x™*’, where the m;; are arbitrary positive integers. 

The theorem is an immediate consequence of 

Lemma 3.1. Let F(x) be a df on the real line such that 


(3.1) 


(3.2) 


where we may have A = —« or B = o~. Then for every 6 > O there exists a cdf 
F*(x) which is a step-function with a finite number of steps, satisfies conditions 
(3.1) and (3.2), and for which 


sup |F*(z) — F(x)| < 6 


To prove Lemma 3.1 we shall need 
Lemma 3.2. If F(x) is any df which satisfies conditions (3.1) and (3.2), there 


exists a df which is a step-function with a finite number of steps and satisfies the 
same conditions. 


The statement of Lemma 3.2 is well known. For example, it follows from 
Shohat and Tamarkin ([9], Theorems 1.2 and 1.3 and Lemma 3.1). 
Proor oF Lemma 3.1. Given 6 > 0, we can choose a finite set of points 


A = @& < @ < G2 < +++ < dm < Gmui = B 
such that p, = F(a,., — 0) — F(a,) < d6forr = 0,1, --- , m. If p, ¥ 0, let 
(0 if a <a, 
P(e) = \Pr lFte) — F(a,)] if a, 52 < ay, 


\] if 41 S 2. 
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By Lemma 3.2 there exists a df F*(x) which is a step-function with a finite 
number of steps and such that 


/ x‘ dF*(2) = [ x‘ dF,.(2), 


F*(2) (0 ife<a,, 
SeRUtbive eS apg. 


0 if x<A; 
F*(x) = (F(a,) + p,F? (x) ifa, S 2 < G41, r= 0,1, --- ,m; 
1 if zB; 
where p,F}(x) = 0 if p, = 0. It can be verified that F*(x) has the properties 


stated in Lemma 3.1. 


4. Continuity of ¢(/). In this section we consider sufficient conditions for 
the continuity of ¢(/) in the sense of Assumption B. The next theorem shows 
that Assumption B is satisfied if ¢(/) is the probability that the random vector 
X with df F is contained in a set S of a fairly general type. 


THEOREM 4.1. Assumption B is satisfied if K(x) = 1 or 0 according as x does 
or does not belong to a Borel set S such that every set 


Sj(t1,°°* ,%j-1,2jar, °°, %n) = (x; :xe8S,x, fixedforkh #7}, j= 1,---,7n, 


is the union of a finite and bounded number of intervals. 
(Here {x: C} denotes the set of all points x which satisfy condition C.) 
Proor. Let 6 be an arbitrary positive number. Let F(x) = Fi(2) --- Fa(xn) 
and G(x) = Gi(x:) --- G,(x,) be any two dfs in € such that 
sup |F,(x) — Gi(x)| < 4, 


Let 


f n 
F® (x) = iu Fs} {I Gad} ; j = 90,1,---,n, 


nj +1 


so that F(x) = G(x) and F‘” (x) = F(x). Then 


o(F) — ¢@) = > (or) — o(F*)), 


We may assume that every set S; is the union of at most N nonoverlapping 
intervals, where N is a fixed number. For a fixed integer j and fixed values 
2, , With h # j, denote these intervals by J,, J:, --- , Ju, where M S N. We 
have 


o(F) — oF”) 


- [te 9 °° * gXj-1,2j41,°°° pita) d{ Fy(21) a F 5-1 ( aja) G ji (x54) a G,(2.)}, 
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where L(x, “9° Bger Beas, to Za) = > a { fr, AF ;(x;) , f:,, dG (x;)}. 


Since 
J 


we get |o(F) — ¢(G)| < 2nMé < 2nNé, so that Assumption B is satisfied. 

Assumption B of Theorem 2.2 is evidently satisfied under more general 
conditions than those of Theorem 4.1. Thus if K(x) = d,K,(x) + --- + dgKp(x), 
where d,, --- , dg are arbitrary constants and ¢,(F) = f K,(x) dF(x) is con- 
tinuous in the sense of Assumption B for r = 1, --- , R, then¢(F) = f K(x) dF(x) 
is also continuous. The same is true if K(x) can be approximated by a function 
of the form Bs d,K,(x), uniformly in the range of the distributions in @. 

Using Theorems 3.1 and 4.1 we can state the following corollary of Theorem 
2.2. 

THEOREM 4.2. Let @ be the class of dfs F(x) = F,(x:) --- Fn(x,) such that 


dF ;(x;) -| dG (xj) | < 28, 
Im 


F(A; = 0) = 0, FB; + 0) = a [oe dF (zx) = Cij, 
: 


where the numbers A; , B; , c:; and the integers m;; are given. Let S be a Borel set 


such that every set |x; :x ¢ S, x» fixed for h # j} is the union of a finite and bounded 
number of intervals. Then 


sup {| dF = sup [ ar. 
Fee YS FeCxi1 8 

5. Concluding remark. The problem considered in this paper can be modi- 
fied by admitting only those dfs in € for which the marginal distributions F, , 
--+ , F, are identical. Some results for this case were obtained in [6]. With this 
restriction Theorem 2.1 is no longer true, and the assumptions are no longer 
sufficient in order to reduce the class of competing dfs to step-functions with a 
bounded number of steps. 

For example, consider the problem of the least upper bound for the expected 
value of the largest of n independent, identically distributed random variables 
with given mean and variance. Hartley and David [5] showed that under the 
additional assumption that the df is continuous the least upper bound is at- 
tained with a continuous df when n = 2. At least for n = 2 it can be shown 
[6] that the Hartley-David bound cannot be arbitrarily closely approached 
with a discrete df having a bounded number of steps. 

On the other hand, if the assumption that the random variables are identi- 
cally distributed is dropped, Theorem 2.2 implies that the least upper bound 
is attained or approached with step-functions having at most three steps. 
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ESTIMATES OF BOUNDED RELATIVE ERROR IN PARTICLE 
COUNTING! 


By M. A. Grrsaicx, H. Ruspin, anp R. SITGREAVES 
Stanford University 


Summary. A statistical problem arising in many fields of activity requires the 
estimation of the average number of events occurring per unit of a continuous 
variable, such as area or time. The underlying distribution of events is assumed 
to be Poisson; the constant to be estimated is the unknown parameter \ of the 
distribution. 

A sampling procedure is proposed in which the continuous variable is observed 
until a fixed number M of events occurs. Such a procedure enables us to form 
an estimate /, which with confidence coefficient a does not differ from \ by more 
than 100 y per cent of \. The values of y and a depend on M but not on X. 

Modifications of this procedure which are sequential in nature and have pos- 
sible operational advantages are also described. 

These procedures are discussed in terms of a chemical problem of particle 
counting. It is clear, however, that they are generally applicable whenever the 
basic probability assumptions apply. 


1. Introduction. The following problem arising in chemical research is typical 
of a statistical problem occurring in many fields of activity. A set of inert par- 
ticles is randomly distributed over a microscope slide of area A. It is assumed 


that the probability of m particles falling in a subset of area a is 
P,\(m|a) = e*(ya)” / m! 


and that distributions in disjoint areas are independent. On the basis of particle 
counts in subareas of A, we want to estimate the unknown parameter \. With 
this estimate we can, by performing the indicated multiplication, also estimate, 
if desired, N = AX, the expected number of particles on the slide. In addition, 
we would like to make a confidence statement about the reliability of our esti- 
mate. 

In general, in discussing the reliability of an estimate, we would like to bound, 
at a given confidence level, either the absolute or the relative error of the estimate. 
That is, for a selected estimate ¢ of an unknown parameter 6, we would like to 
say with confidence coefficient a, either 


t#-eas6B or (|t/0—1| Sy, ie, (1—y) St/os (1+), 


where a and 8 (or y) are suitably chosen numbers which do not depend on @. 
In the present problem, it seems reasonable to be concerned with the relative 


Received November 18, 1952, revised May 20, 1954. 
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rather than the absolute error of the estimate, since this error is independent of 
the units in which the area is measured and is the same whether we are estimating 
\, or AX, the expected number of particles on the slide. What we would like, 
therefore, is an estimate / of \ such that we can say with confidence coefficient 
a that 1 does not differ from \ by more than 100y per cent of A, where neither 
a nor y depend on the true value of \. Such an estimate we call an estimate of 
bounded relative error. Moreover, among all estimates of this type we would like 
to choose one possessing some optimal properties. 


2. Fixed area sampling procedure. In estimating \ in problems of the present 
kind, it often has been the practice to select n non-overlapping subareas of A, 
each of size dy) , with n and ay determined in advance, and to count the particles 
in the selected subareas. The usual estimate of \ is then 

1 n 
l = — ry 
Nap i=l 
where x; is the number of particles observed in the ith subarea. It is easy to see, 
however, that for a given value of y, 


Pil — vy) $1 SA + y)} 


is a function of the unknown parameter \, so that no confidence statement con- 
cerning a bound on the relative error of 1 can be made. 

On the other hand, from the observed value of 1, we can determine values for 
two functions L, and L, defined by 


exp {—L,(l)nao} (L4()nao)’ _ 1 — a 


: _ 
7! 2 


exp { —L,(D)nao} (L,(D) nav)? 1a 


y! 2 
Then, we can say with confidence coefficient approximately a that 


A) = r < LQ, 
M1 — y()) < () < M1 + yO), 


’ 


where 


2L, DL. (1) al LAD a L,()) 


i) = 


LA) + 0,0’ — LO + 0" 
It follows that if we estimate by i l), we can say with confidence coefficient 
approximately a that the relative error of I(1) is bounded by y(1), but this quan- 
tity is a chance variable whose expected value depends on the unknown value 
of X. 

Since a fixed area sampling procedure does not yield estimates of bounded 
relative error, we consider the possibility of using another type of sampling 
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procedure. (Some properties of fixed count and fixed time experiments are dis- 
cussed by Albert and Nelson [1] for the case of counter data for radioactivity.) 
By a suitable adjustment of the microscope, the aperture can be increased 
gradually so that the area under observation increases continuously from a 
point to any desired magnitude, within the limits of the area of the slide. 

If the area is expanded only until a fixed number of particles is counted, the 
magnitude of the observed area becomes a chance variable with an attached 
probability distribution. In this distribution, the parameter \ appears as a 
scale parameter. It has been shown [2] that if the single unknown parameter of a 
distribution is a scale parameter, estimates of bounded relative error exist, and 
among them one possessing specified optimal properties can be found. By 
adopting a fixed particle count procedure and applying the general theory, our 
problem is formally solved. 

In the following sections, a fixed particle count procedure is explicitly defined 
and an estimate of bounded relative error is proposed. For the given value of y 
and number of particles counted, it maximizes the value of the confidence co- 
efficient a. Some modifications of this procedure are also discussed. 


3. Fixed particle count procedure. Suppose in counting particles under a micro- 
scope, the area under observation is expanded until a fixed number of particles, 
M, is counted. The magnitude of the area so obtained, say ay , is a chance vari- 
able with an attached probability distribution. Since we have assumed that the 
probability of m particles falling in a subset of fixed area a is 


P,(m | a) = (Aa)"e™* / m!, 


the probability that an area of size z/d has fewer than M = Dertcles is 


M-1 j 


Pd t on 
Piray > 2} =e 2 = la ara =H" 


It follows that \ay = X has the density function 
fu(z) = e 72“ / (M — 1)!. 


That is, A\ay has a gamma distribution with parameter M, or 2\ay has a chi- 
square distribution with 2M degrees of freedom. 

Suppose now that an observation ay is made, and we form the estimate / of 
d defined by 1 = b/ay, where b is a given positive number. If + is the desired 
bound on the relative error, the probability, before the observation is made, 
that 


(i-y)silAps(1+y) 
is given by 


( Db b \ b/1— ft 
<= = ——————,, = 
PY] py Se ST = Las m= pi® ¥+(b, M), say. 
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The quantity y,(b, M) does not depend upon X, so that for all A, 1 is an estimate 
with relative error bounded by y at confidence level y,(b, M). Clearly, we would 
like to find that value of b, say b*, for which y,(b, M) is a maximum. 

As b varies from 0 to «, the value of y,(b, M) increases continuously from 
zero to a maximum value y,(b*, M), and then decreases again to zero. To deter- 
mine b*, we consider 


a a ebii—y gftt =? eb/l+y Paitin’ Ties 
ap ¥1 M) = StI (M — pi” *) (M — sia | 


fi) —b/(1l— y) —b/ (i+) 
(M — 1)! la i Aa > 
‘The maximizing value is the single finite positive value of b for which 
dy,(b, M) /ab = 0; 


that is, 


2 
(3.1) pf a. MO Ps 
2y 1 Y 


For b*, we have 
M-—1 —z 


zx é 


¥,(b*, M) = , dx = y;(M), say, 


2 (M — 1) 
yous: [xs - 7) 


log — M(1 + 7) log pt]. 
2y l1-—y¥ 2y 1-y 
For a fixed value of y, the function y¥ is a single-valued monotone increasing 
function. The monotonicity of y> follows from the fact that ay is a sufficient 
statistic for a, ,--- , @y. It follows, therefore, that if we define My to be the 
least integer such that ¥3(M) = a, then Mo = n,(a) is a single-valued monotone 
increasing function of a. 

With these considerations in mind, we propose the following fixed particle 
count procedure. The desired values of y and a@ are specified in advance, and 
from these we determine M = 7,(a). The area under observation is expanded 
until M particles are counted, resulting in an observation ay . We estimate X, 
the average number of particles per unit area, and N, the expected number of 
particles on the slide with total area A, by 


* = b* /ay, N* = Al*, 


where b* is defined in (3.1). In either case, the relative error of the estimate is 
bounded by vy at confidence level at least a, regardless of the true value of X. 

If our estimation problem is formulated as a decision function problem in 
which our action space is the positive half of the real line, that is, 


fl: OST < ow}, 





280 M. A. GIRSHICK, H. RUBIN, AND R. SITGREAVES 


and our loss function is 


10 if {(l/A) — 1| Sy, 
L(d, l) = 4 « 
(1 if |(l/A) — 1| > ¥, 
the proposed estimate /* is the best invariant estimate. Since the loss function 
is bounded, it is also minimax [3]. The risk when we estimate \ by /* is 1 — ¥4,(M). 


4. Values of y, M, and ¥3(M). Table 1 gives values of y4(M) for four values 
of y and for M < 40. For M > 40, the distribution of (./4\ay —*/4M — 1) is 
approximately normal with zero mean and unit variance. In this case, therefore, 


¥3(M) ~ o( Se ) y) log +7, — V/4M — i) 
y Y ait y 
mn 4M (1 — y) ees aa 
o(y Dy log 5 “ V/4M i), 


where $(u) = [“. (2x) te"? dt. SinceeO < ¥ < 1, we have 


1, 1+y7 Y,7,7 
— =] —_— ene ae cle . 
a T=, tagtstyt 


2 - 2 - 
* aur ( Y Y ff wily ‘2 
M)~® Mi—+-=-+— — M { Ba r 
¥;(M) | vam (5 +2+24)| o| Val (3 3 ir) 
For values of y and @ which are generally of practical interest, a good approxi- 
mation for the required value of M is given by the relation 


(4.1) 


m(%+14).; 
(4.2) /4M (2 + z *) ; 


where z, = ® [3(1 + a@)], that is, 6(z.) — (—z.) = a. 

For y = .10 and a = .90, .95, and .99, respectively, this approximation yields 
the same values of M as those determined from (4.1), while for y = .05, the 
values differ only slightly: 


9 


C 660 | (4.1) or (4.2) 
f 1081 ! 2651 (4.1) 
' 1081 2650 (4.2) 


5. Modified fixed particle count procedure. Instead of expanding the area 
continuously until M particles are counted, it may be more convenient to adopt 
a sampling procedure consisting of k subsamples. In the jth subsample, the area 
is expanded continuously until a fixed number of particles m; is counted, with 

im; = M, and with the provision that areas observed in the different sub- 
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TABLE 1 


r * hd 1 M—1 —z 
Values of y,(M) ™ arab? e dz, 
p= [MOD DM, MOL yg +0) 
fy (1 — ) 





3753 
-4292 
.4746 


2700 .5140 
. 2909 5487 
.3102 . 578 

3281 -6075 
.3449 .6329 


3607 6560 
.0389 é 3757 .6771 
.0405 4 . 3898 6966 
0421 ' .4033 7146 
.0436 ‘ 4162 .7313 


.0450 os 4286 7468 
.0464 i 4404 .7612 
.0477 ‘ 4518 .7746 
0490 .2418 .4628 7871 
-0503 . 2479 4733 .7988 


samples are nonoverlapping. If a)? 


we now estimate \ by 


is the area observed in the jth subsample, 


k 
Ir = ue /& a, 


j=l 


Since each of the an, is independently distributed in a gamma distribution 
with parameter m;, their sum, because of the reproductive property of the 
gamma (or chi-square) distribution, is again a gamma distribution with param- 


eter > m; = M. The theory now goes through as before. 


6. Sequential Procedure. For practical purposes, it may be convenient to 
modify the sampling procedure in the following manner. The desired bound on 
the relative error, y, and the confidence coefficient a are specified, and the cor- 
responding value of M required for the fixed particle count procedure is de- 
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termined. The area under observation is now expanded until a fixed number of 
particles, r, is counted; 1 s r < M. 

Let a, be the size of the area so observed. Subsequently, non-overlapping 
areas of fixed size ca,, where c is a given positive number, are examined suc- 
cessively, and the number of particles in each area is counted. Let »; be the 
number of particles counted in the ith area of size ca, . Sampling stops with the 
nth such area if 


Ne<M-rsN=M-r-+u, 


say, where 


n—l 


N =>) %, N => %. 
t=] t=] 
In this procedure, the quantities a, , v1, «++ , v», and 7 are all chance variables. 
When sampling stops, we estimate \ by 


je = Mt+wd-v)) ity 
2ya,(1 + nc) .- Y 
That is, we estimate \ in the same manner as before, but replace M and a, by 
(m + u) and a,(1 + ne), respectively. We state with a confidence coefficient at 
least as large as a that the relative error of /* is bounded by y. 
This statement is justified as follows: the joint probability density of ; , -- - 
v,,nand X = da, is given by 


—s rl n —Cz ¥E 
at>-- a= —— fi (-="), 


(r — 1)! i=l 


v;! 


(6.1) <4 
z y,<M-r 


i=] 


(The density is with respect to discrete measure for 1,-°-:, vn, m, and to 
Lebesgue measure for x.) This can be written as 


fry + nc) roe _rte—t 


(+N-))! 


The probability before any observations are made that 


(v1, °** 5 Yn, N, 2) = 


‘h(n, ++ ,n). 
* 
1-ysi sity 


is equal to 


- (Ti-y 1+ 7 _ da,(1 + ne) 
— ) ae. er asta’ 
LY Lhw, vrs mP yf Qy log 5 y = r+N 
1 


27 
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oO 


D> DV ht, ae Yn» M)Wr(r + N) 


n=l 


HI) OD hon, rt Yn» 0) 


= ¢@ 


All the sums without indices are over all sequences », , --- , v, satisfying (6.1). 


7. Probability distribution of n. For any sequential procedure, it is of interest 
to determine, if possible, the probability distribution of n together with its ex- 
pected value. For the proposed procedure, this distribution is given by 


g(n) = > h(n, 7, i, 
(r +N —1)!c” 
= LGrinto nl ba 


oc M—r—1 ty} 


> 


ta=M—r tn ;=0 t,_2=0 


ae (r +t, — Ite™ 
Amo (r — 1) Mtn — tea)! «+ (te — &) IIL + ner 
. & FS. ..&te5eeeoe 
tn=M—r ty)=0 (r — 1) ! (tn — tn—1) ! 6.3! (1 ed nc)ttin 
oc M-—r—l (M + ~— 1)! cM" (n a 1) tam} 
imo theme (F — IMM — r+ u — ta) lta! (1 + ne)™"" 
As before, all sums without indices are over all sequences » , --- , v, satisfying 


(6.1). 
Since 


M—r—1 as tami 1 M—r—1 ws u 
ym (n 1) < git-rta t (1 t) dt, 
tnd tna! (M — r + u — ta_s)! 1-1jn (M — r — 1)! u! 


i (M + u — 1)'(ne)¥"™ y  gtnestes at 
g(n) = x WW —+r— ie — flat a wae hua. = Oa 


if (M ae 1) !(nc)"~” pint 
~ Jaye MM — 7 — Dir — DIA F no* 


— (M+u-—1)! ne \* 7 
p> (M — 1)!u! (-#..) (1 — 0 }a 


dt, 


vis [ (M ue 1) (ney 
1-1tjn (M — r — 1)'(r — 1)!C1 + net)™ 
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Writing tr = net / (1 + net), we have 


s (M — 1)! M=—r—1 | 
g(n) = aac (I 1)" dr, 


aoa RM 
1+c(n — 1)’1+en]° 
From this we determine the expected value of n as a function of M, r, and c. 


B(n) = Y ngln) = De oln) + Lon) + -- 
My + 


0 remnant nmenmesen M—r—1 _ Newt 
6 H=NIM = r- DIS - (1 — 7)” dr, 


2) c(n — 1) 
s-[ 2-254]. 


8. Modified sequential procedure. Yet another sampling procedure yields an 
estimate of \ with relative error bounded by y at confidence level a. This is the 
procedure in which we first obtain an observation a, and then count the number 
of particles falling in a given number m of non-overlapping areas, each of size 
ca, , where 7» is determined by the selected values of y, a, c, and r. From the ob- 
servations a, and »,,---, v,,, we estimate A by 


rn ee by 
(8.1) Poor ios |r + ¥».| log 7: 


We state, with confidence coefficient a, that the relative error of i* is bounded 
by y. Consider the confidence coefficient with which we make this statement for 
any arbitrary value of m . The joint probability distribution of a, and »,, --+, 
vn, is defined by the function ¢ given in (6.1), with no restrictions on the values 
of the »v; and with n = mp. Hence, the probability, before any observations are 
made, that 


l-ys */~As1+y7 


is given by 


EE Evi (rt Zen) Mer srreene 


i=l 


and is independent of \. For fixed values of r and c, this is a monotone increasing 
function of nm. By a proper choice of no , the value of the confidence coefficient 
can be made as close to | as desired. 


9. Estimate of bounded relative error for the variance of a normal distribu- 
tion. In all of the sampling procedures yielding estimates of bounded relative 
error for the problem of particle counting, the function ¥* plays an important 
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part. This function also appears in the problem of estimates of bounded relative 
error for the variance of a normal distribution. Suppose we have N independent 
observations x; , --- , y on a chance variable X, where now X is normally dis- 
tributed with unknown mean yp and variance o?. Our problem is to estimate o’, 
and we would like our estimate to be of bounded relative error. 

If w’ = >°Y (x, — #)’, the distribution of w’ / o’ is a chi-square distribution 
with N — 1 degrees of freedom. In this distribution, o° appears as a scale param- 
eter and is the only unknown parameter in the distribution. With the use of 
w, therefore, we can find an estimate of bounded relative error possessing 
specified optimal properties. For given values of y and n = N — 1, the estimate 
é* which maximizes the confidence coefficient with which we state that 


l-ys@#/oeS1+4+y7 


is given by 


= 2yu* /|n log +1], 
i~-4q 
1 —n/2 n—2 - 2 2 
| + ¥} as 79) 12 20%) ¢ 2, x72 d(*) 


"% 


=¥5 (3), 


- [OF tog it nd tog i +7]. 


2y Rg ae Da, 


Thus, making the substitution n = 2M, Table 1 gives us for n = 4, 8, --- , 80, 
values of the confidence coefficient with which we assert relation (8.1). As 
before, we can determine n so that y%(n/2) = a with a chosen in advance. For 
larger values of n, ~/2x2, — +/2n — 1 is approximately normally distributed 
with zero mean and unit variance, and corresponding approximations can be 
made, 
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A CHARACTERIZATION OF SUFFICIENCY! 


By R. R. BAnApuR 
The University of Chicago 
Summary. The main conclusion of this paper can be described as follows. 
Consider a statistical decision problem in which certain structural conditions 
are satisfied, and let 7 be a statistic on the sample space. Then the class of de- 
cision functions which depend on the sample point only through T' is essentially 


complete if and only if 7 is a sufficient statistic. The structural conditions in 
question are satisfied in many estimation problems. 


1. Introduction. In a non-sequential decision problem, let X = {x} be the 
sample space, P = {p} the set of alternative probability distributions on X, 
D = {t} the (terminal) decision space, and L,(¢) the loss incurred in making the 
decision ¢t when the (unknown) distribution on X is p. It is assumed that P is a 
dominated set of distributions (i.e., there exists a fixed o-finite \ such that each 
p in P admits a probability density function with respect to A), and that D is, 
or may be taken to be, a subset of k-dimensional euclidean space (1 S k S ~),. 

For each decision function yu let the corresponding risk function be denoted 
by r,, that is, for each p in P, r,(p) = the expected value of L, in using u. Let 
be the class of all decision functions on X to D. Let T(x) be a function on X 
(onto an arbitrary space Y of points y), and let Dr be the class of all » in D 
which depend on zx only through 7’. The class Dr is said to be essentially com- 
plete if for each uw in D there exists a vy in Dr such that r,(p) S r,(p) for each 
p in P. 

T is said to be a sufficient statistic for P if, for each set A of X and each value 
y of T, the conditional probability of A given T(x) = y is the same for each p 
in P. It is well known (see, for example, [1], [2], [3]) that if 7’ is sufficient for P, 
then Dr is equivalent to D in the sense that for each u in D there exists a pv in 
Dr such that r,(p) = r,(p) for each p in P. 

It is shown in this paper that if the loss function L satisfies condition III of 
Section 2, and Dr is essentially complete, then 7 must be sufficient for P. Con- 
sequently, if III is satisfied, then, for any statistic 7’, any one of the statements 
“Dr is essentially complete,” “Dr is equivalent to D,’’ and “‘T' is sufficient for 
P” implies the other two. 

The following are some simple examples of decision problems in which condi- 
tion III is satisfied. 

EXAMPLE 1. Let P be a finite set, P = {pi1, pe, +--+, pn} say, let D be the 
set {1, 2,---, nm}, and let Lj) = Oif 7 = j and =1 if i ¥ j for i, j = 
1,2,-++,n. 
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In the following examples it is assumed that P is a parametric family of dis- 
tributions, P = {pe} say, where @ is a real-valued parameter, and that the set 
Q of all values of @ is an interval of the real line, say Q = {0:0 < @ < 6}, where 
—-xSH< AS @. 

Examp.e 2. Let D = Q, and let Le(t) = (t — 0)’. 

Examp .e 3. Let D be the set of all those points (r, s) of the plane for which 
6 <rZSs < 6, and let each point ¢ = (r, s) of D correspond to the decision 
that r S 0 S s. Let L(t) = h-Wo(t) + k-(s — r), where h and k are positive 
constants, and W, = Oifr < @ S sand =1 otherwise. 

The preceding examples suggest that condition III is typical of problems of 


inference in which “nuisance parameters” are not involved. This is indeed the 
case. 


2. Results. Let there be given: an abstract space X of points z, and a o-field S 
of subsets of X; a set P of probability measures p on S; a set D of points t, and 
a o-field D of subsets of D; and a real-valued non-negative function L,(t) on 
P X D such that, for each p, L, is a D-measurable function of ¢t. It is assumed 
that (X, S), P, (D, D), and L satisfy the following conditions. 

ConpiT10n I. The set P of measures on S is a dominated set containing at 
least two measures. 

It should perhaps be stated here that in the case when P contains only one 
measure, the conclusions of this paper hold entirely trivially. 

ConniT10n II. The decision space (D, D) is of type (R, R) ({3], Section 7), 
and D contains at least two points. 

Let the closed interval (0, 1] be denoted by /, and let I be the class of Borel 
sets of I. Let p and q be two different measures in P, and define 


(1) a(u, t) = u-L,(t) + (1 — u)-L,(t) 
for uin J and tin D, and 


(2) B(u) = inf {a(u, 1} 


for u in I. We suppose that there exists a function, + say, on J into D, such that 
(i) r is an I-D-measurable transformation, and (ii) 


(3) a(u, r(u)) = B(u) for each u in J, 


In general, the function 7 depends, of course, on the p and gq under consideration. 

An additional condition which we require is that the loss function L be quite 
sensitive, in a certain sense, to the difference between p and g. One condition 
of the type required is that the function 7 of the preceding paragraph be uniquely 
determined and one to one. This condition is, however, unnecessarily strong, 
and it can be weakened, with advantage, as follows. 

Let c:, ¢,°** be an enumeration of the rational points of J, excluding the 
end points 0 and 1. For each i = 1, 2, --- define 


(4) yi(u) = cu + (1 — ¢)-(1 — wu), 0<% < |, 
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and 


(5) 6(u) = c-u/ yi{u), 0 < 6, 


for u in J. We suppose that (iii) if wu and v are any two points of J with u + 
then a(5;(u), r(6:(v))) > B(6:(u)) for at least one 7 = 1,2,---. 

Conpition III. Corresponding to any two measures p and g in P with p = gq, 
there exists a function 7 on J into D which satisfies (i), (ii), and (iii). 

This condition is stated here in a form well adapted to our immediate pur- 
poses. It (or rather, its essential content) can be stated more simply as follows. 
Let T denote the zero-sum two-person game in which the spaces of pure strategies 
of players 1 and 2 are P and D respectively, and the payoff is L. For any p and 
q in P with p ¥ q, let T',, denote the subgame in which player 1 is restricted to 
the two pure strategies p and q and their mixtures. Then III is essentially the 
condition that each subgame I’,, is nontrivial and compact for player 2, in the 
following sense: there exists no ¢* in D such that L,(é*) < L,(t) and L,(t*) s 
L(t) for all t in D; and corresponding to each strategy of player 1, there exists 
a pure strategy of player 2 which minimizes the payoff. An amplification of this 
remark, in the form of a useful sufficient condition for III, is given in Section 3. 

A decision function is a function on D X X, uw say, such that »(C, x) is a 
probability measure on D for each x and an S-measurable function of x for each 


C in D ([3], Section 7). For any decision function y, the risk function r, is defined 
by 


(6) r,(p) = [Af L,(b) ine} dp, 


where, for each z, the expression in { } is the integral of L, over D with respect 
to » with z held fixed. 

Let D be the class of all decision functions. Let T be a function on X onto a 
set Y of points y, and let Dr be the class of all v in D which are of the form 
v = vo(C, T(x)). 

THEOREM. Dr is an essentially complete subclass of D if and only if T is a suffi- 
cient statistic for P. 

Proor. Suppose first that 7' is sufficient for P, and consider a u in D. It follows 


from Theorem 7.1 of [3], using condition II, that there exists a vy in Dr such 
that 


(7) [ u(C, x) dp = [ «. T(x)) dp 


for all C in D and p in P. It follows from (7) and (6) that (for any loss function 
L) rp) = r,(p) for each p in P. Since yu is arbitrary, we conclude that Dr is es- 
sentially complete. 

Suppose next that Dr is essentially complete. Choose and fix p and g in P, 
with p # q, and define \(A) = p(A) + q(A) for A in S. Since p(A) S X(A), the 
Radon-Nikod¥m theorem yields the existence of an S-measurable function, 
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g(x) say, such that 0 S g S 1, and 


(8) p(A)= | gar, (A) = fa-oa 


for all A in S. 


Let c; be an arbitrary but fixed rational number such that 0 < ¢; < 1, and 
for any decision function » define 


(9) Fy = city(p) + (1 — ci)-1,(Q). 
We see from (6) and (8) that the right side of (9) is equal to 


oft Ly dus} 9 dd + - fi La du (1 — g) dd 


\ 
{ 


= [Af legL, + (1 — c) (1 — g)L,) dues dx. 


Hence, for any uy, 


\ 


(10) ru = [aloe {  a€6oCe)), 0 duc} a, 


where a, y; , and 6; are given by (1), (4), and (5). 

Corresponding to the p and q under consideration, let + be a function on / 
into D possessing properties (i), (ii) and (iii) of condition III. Let & be the non- 
randomized decision function which, for each xz, assigns probability 1 to the 
decision 7(6;(g(x))). It then follows from (2), (3), and (10) that 


(11) fe = inf (#4) = | lg) BC) ad. 
aeD x 


It is convenient at this stage to consider explicitly the sample space of the 
values of 7’. Let T be the o-field of all sets B C Y such that 7” '(B) is in S, and 
for any measure m on S denote the induced measure on T by m*, that is, m*(B) = 
m(T '(B)). We have been regarding Dr as a class of decision functions on 
(X, S), but Dz can also be regarded as the class, D* say, of all decision functions 
on (Y, T). In particular, »(C, T(x)) < »(C, y) is a one to one correspondence 
between Dr and D* such that, for any probability measure m on §S, 


r,(m) = LAf. Lin(t) dro} dm = Lif. Lin(t) as) dm* = r,(m*). 


By replacing X, S, p, g, and D by Y, T, p*, q*, and D* in the argument leading 
to (11), and then rephrasing the outcome according to the correspondence just 
stated, we obtain the following result. There exists a non-randomized decision 
function in Dr, 7 say, such that 

(12) 7, = inf {7,}, 


veDr 
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and such that, for each z, » assigns probability 1 to the decision r(é,(f(x))), 
where 


(13) f(z) = A(T(2)) 


and h(y) is a T-measurable function on Y (depending only on p* and g*) such 
thatO SAS 1. 

Consider the decision function ¢ defined above. Since Dr is essentially com- 
plete, there exists a vy in Dr such that r, S r; for each measure in P; in particular, 
r,(p) S re(p) and r,(q) S r:(q). Hence, 7, S 7; by (9). It now follows from (11) 
and (12) that 7, = 7. Using (10) to evaluate 7, (cf. the definition of 7), it is 
easily seen from (11) that this last equality is 


(14) [ O)-al6.0), +6) a = ¥(G)-B(6«GQ)) aD. 


Now 0 S 8(u) S inf, {max [L,(t), L,(t)]} < © by (1) and (2), and0 < y;(u) < 
1 by (4), so that y,;(u) -8(u) is a bounded function of u. Since X is a finite measure, 
it follows that the right side of (14) is finite. Hence, writing 


(15) gi(z) = a(3i(9(x)), r(S(f(@)))) — B(6i(9(x))), 


(14) is equivalent to 
(16) [ vi(g)-ei dd = 0. 


Since yg; = O for each x by (2) and (15), and y.(g) > O for each x by (4), (16) 
implies that there exists an S-measurable set, N; say, such that \(N;) = 0, and 
¢gi(x) = Ofor each z in X — N;j. 

Since in the preceding argument c; is arbitrary, it follows that there exists an 
S-measurable set N (= U;N,) such that A(N) = 0 and such that for each z in 
X — N we have ¢;(x) = Ofori = 1, 2, --- . Hence by the definition (15) of the 
sequence ¢1, g2, °°: , condition III(iii), and (13), we have g(x) = A(T(zx)) on 
X — N. Write ¥,(y) = A(y) and y,(y) = 1 — h(y). Then y,(7(z)) and ¥,(7(z)) 
are non-negative S-measurable functions of z, such that dp = y,T dd and dq = 
y,T dd on S, by (8) and the equivalence of g and hT which we have just es- 
tablished. Hence, by the factorization theorem for sufficient statistics, T is a 
sufficient statistic for the set Po = {p, q}. 

Since in the preceding argument p and gq are arbitrary, we have shown that 
T is pairwise sufficient for P in the sense of [1]. It now follows from Theorem 2 
of [1], using condition I, that T is sufficient for P. This completes the proof of 
the theorem. 

The following comments concerning the condition III are relevant to the 
theorem of this section. (a) If the given loss function L,(#) satisfies III, then so 
does any loss function of the form k(p)-L,(t) where 0 < k(p) < © for each 
p in P. This is as it should be, since essential completeness of a class Dr is in- 
variant under such modifications of the loss function. (b) For each p in P, let 
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D, be the set of all points in D which minimize L,(t). It is easily seen that III 
implies that {D,} is a family of nonempty and disjoint subsets of D. Conse- 
quently, III cannot be satisfied in reasonable formulations of problems such as 
testing hypotheses (except when the hypothesis and the alternative are both 
simple), or estimating a parameter @ such that more than one p in P has the 
same value of @. This, again, is not unexpected, since in such problems the useful! 
concept is that of “sufficiency for the relevant parameter” rather than the un- 
restricted sufficiency with which the theorem is concerned. (c) Condition III 
is not, however, necessary to the theorem. It can be shown by examples that if 
III is not satisfied, the theorem may or may not hold. 

Let @ be the class of all admissible decision functions. 

Coro.tiary. Suppose that @ is a complete class. Then a statistic T is sufficient 
for P if and only tf for each u in @ there exists a v in Dr such that r,(p) = r,(p) 
for each p in P. 

Proor. It follows from the definition of admissibility that if @ is complete, 
then a class Dr possesses the property stated if and only if Dr is essentially 
complete, and the theorem applies. 


3. Condition IV. Let p and gq be measures in P, and consider the function 
F,,(t) = [L,(t), L,(t)] which maps D into the plane. Let J,, be the range of 
Fp. Let us say that a point (r, s) in J»yq is admissible if there exists no (r*, s*) 
in Jp, such thatr* Sr, s* S s,andr* + s* < r+ s. Let K,, be the set of all 
admissible points of J,,. Let us also say that K,, is a complete subset of J,, if 


for each (r, s) in Jy, there exists an (r*, s*) in K,, such that r* <r, s* Ss. 
The terms “admissible” and “complete” are borrowed from statistical decision 
theory, but as used here they refer not to the statistical decision problem, nor 
even to the game I, but only to the subgame I, (cf. Section 2). 

It is well known (and easily shown by examples) that K,, can be the empty 
set, and that even if K,, is nonempty it need not be complete. 

ConpiTi0on IV. For any two measures p and g in P with p ¥ gq, (a) Ky, con- 
tains at least two points, (b) K,, is a closed and bounded subset of the plane, 
(c) Kyq is a complete subset of J,,, and (d) for each (r, s) in K,, there exists 
only one point ¢ in D such that F,,(¢) = (r, s). 

It can be shown that II and IV imply III. An outline of the proof follows. 

Consider fixed p and q in P with p ¥ q. Parts (a), (b) and (c) of IV assure the 
existence of a 7 which satisfies parts (ii) and (iii) of III. The additional conditions 
IV(d) and II (together with the fact that the inverse of a 1-1 Borel measurable 
function is Borel measurable) assure that 7 also satisfies III(i). The construction 
and detailed verification of r is, however, rather lengthy, and it seems best to 
omit it. Note that since the conditions [V(d) and II are used only to assure that 
r is measurable, they are superfluous in applications where the measurability of 
r is not in doubt. 

The conditions II and IV are satisfied in each of the examples of Section 1. 
Indeed, they are satisfied in Example 1 with any Z such that L;(j) > L(t) 
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whenever 7 # j; in Example 2 with Lo(t) = a(6)- | t — 0|*, where 0 < a(0) < 
oo and 0 < b(@) < ©; and in Example 3 with Ly(r, s) = Aoe(r, s) + Be(s — r) 
where Ay = Oifr S 0 S s and > O otherwise, and B,(z) is a strictly increasing 
function of z 2 0 with B,(0) = 0. 


4. A theorem of Elfving. In this section we suppose that there are given (X, S$), 
P, and (D, D), as before, but that the loss function Z is not specified. The class 
Dr is said to be uniformly essentially complete if, for every loss function L, Dr 
is essentially complete. The concept of uniform essential completeness is due to 
Elfving [4]. In his paper, he showed (using a notation and terminology which 
differs slightly from the present one) that if each of the sets X, P, and D is 
finite, then Dr is uniformly essentially complete if and only if T is sufficient for 
P. We shall show that this result is valid provided only that P and D satisfy 
conditions I and II. 

In view of the first part of the proof of the theorem in Section 2, we need only 
show that if Dr is uniformly essentially complete, then T is sufficient. Let p and 
q be two measures in P, p ¥ q, and let Py = {p, q}. It is easily seen that the 
hypothesis implies that Dr is uniformly essentially complete for (X, S), Po , and 
(D, D). Let r and s be points of D, r ¥ s, and define 


(0 ift=r 
L,(t) = i! ift = 
otherwise, 


2 

l fier 

0 ift=s 
otherwise. 


Then, as is easily seen, I, II, and III are satisfied in the problem (X, S), Po, 
(D, D), and L. Since Dr is essentially complete for this problem in particular, it 
follows from the theorem of Section 2 that T is sufficient for Pp = {p, q}. Since 
p and q are arbitrary, it follows from Theorem 2 of [1] that 7’ is sufficient for P, 
as was to be shown. 

Let us say that Dr is uniformly equivalent to D if, for every loss function L, 
Dr is equivalent to D in the sense of Section 1. Let us also say Dr is strongly 
equivalent to D if corresponding to each yu in D there exists a v in Dz such that 
fx v(C, T(x)) dp = Jx u(C, x) dp for all C in D and p in P. Now, it is shown in 
[3] that sufficiency implies strong equivalence. The facts that strong equivalence 
implies uniform equivalence implies uniform essential completeness are evident, 
and we have just seen that uniform essential completeness implies sufficiency. 
Consequently, the concepts of sufficiency, strong equivalence, uniform equiva- 
lence, and uniform essential completeness afford equivalent comparisons of 
Dr and Q, at least when I and II are satisfied. 

The conclusion just stated could be regarded as a strong result in the compari- 
son of experiments in the special case when one of the two experiments being 
compared is a contraction of the other (cf. [5]). By so regarding it, it follows, in 
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particular, that (at least in the case when (X, S) is of type (R, R) and P is 
dominated) a statistic y = 7'(z) is sufficient for x in Blackwell’s sense [5] if and 


only if y is sufficient for z in the classical sense, that is to say, T is sufficient for 
P. 


REFERENCES 


[1] P. R. Hatmos anp L. J. Savaes, “Application of the Radon-Nikodym theorem to the 
theory of sufficient statistics,’? Ann. Math. Stat., Vol. 20 (1949), pp. 225-241. 

{2} Davip BLACKWELL AND M. A. Grirsuickx, Theory of Games and Statistical Decisions, 
John Wiley and Sons, New York, 1954. 

[3] R. R. Banapur, “Sufficiency and statistical decision functions,’?’ Ann. Math. Stat., 
Vol. 25 (1954), pp. 423-462. 

[4] G. E_rvine, “Sufficiency and completeness in decision function theory,’’ Ann. Acad. 
Sci. Fennicae, Ser. A, 1. Math.-Phys., No. 135 (1952). 

[5] Davip BuackwELL, “‘Equivalent comparisons of experiments,’’ Ann. Math. Siat., Vol. 
24 (1953), pp. 265-272. 

[6] A. Waxp, Statistical Decision Functions, John Wiley and Sons, New York, 1950. 





DISTRIBUTION OF THE MAXIMUM OF THE ARITHMETIC MEAN OF 
CORRELATED RANDOM VARIABLES 


By JoHn GURLAND 


Towa State College 


1. Summary. The initial distribution considered here is obtained from a 
multivariate analogue of the Pearson Type III distribution, and the value of 
the correlation is taken to be non-negative. There is obtained here the dis- 
tribution of the maximum in samples of fixed size n from a random variable 
which is the arithmetic mean of k such correlated random variables. This dis- 
tribution is obtained for large values of nm and for large values of k. The ap- 
propriate expressions for the mode and scale parameters are also given. 


2. Introduction. The mathematical model presented here is applicable when 
the aforementioned samples of size n consist of independent observations from 
a fixed population. The distribution function for this population, given by (4) 
below, is obtained by regarding each of the k correlated random variables as 
having the same fixed Pearson Type III distribution, and as having the same 
correlation with each of the remaining variables. Such a model may be relevant, 
for instance, in considering the tensile strength of a substance; in this case n 
is roughly proportional to the number of flaws. Another possible field of applica- 
tion is the investigation of maximum coincident loads in electrical engineering 
problems; but in this case there are difficulties involved in determining the 


appropriate value of n, and in assuming that the n observations of the sample 
are independent. The above mathematical model could be extended to cover 
such situations; however, the present article is confined to the theoretical prob- 
lem stated in the summary. 


3. Distribution of the arithmetic mean of correlated Pearson Type III vari- 
ables. In many problems the primary distribution is skew and is well fitted by 
a Pearson Type III distribution 

—2z/6 dA—1 
(1) pz,(2) = — x > 0. 

Before considering the problem concerning the maximum value of the mean, 
it is first required to ascertain the distribution of (1/k) >2 i, Where each Z; 
has the probability density (1) and there is a constant correlation p between 
every pair of Z’s. The assumption of constant correlation is indeed restrictive, 
but it is most convenient to have a single parameter which measures the over-all 
correlation of the population. It is possible to consider more general models by 
the same methods described below, but the present article considers only the 
case of constant correlation. 


Received September 21, 1953, revised August 17, 1954. 
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The characteristic function of a multivariate analogue of (1) is obtained by a 
device, similar to that used by Krishnamoorthy and Parthasarathy [1], as 
follows. Consider the multivariate normal probability density 


1/2 , 
m1) = Gyaape (— a4): 


where X = (X,, X2, +--+ , Xx) and Q is a positive definite matrix. The charac- 
teristic function of Y = (Xj, X2,--- , Xi) is 


lay? 


dr(t) = jo — 2i°T PR , 


where T is the diagonal matrix 


Bo Gaeulog, 


Then [¢y(t)]” is the characteristic function of a natural multivariate analogue of 
the x’ distribution. Now 


[or())]" = I — 2° TA" 
where J is the unit n x n matrix. Take 4m = } and 20° = @ to obtain 
(2) oz(t) = | — wTa"]>. 


This is the characteristic function of Z = (Z,, Z2, +--+, Z,. where each Z; 
has the probability density (1). Note that in (2) any positive de. nite covariance 
matrix Q is permissible. The special case of interest here is 


Qt = 


pp 

The characteristic function of (1/k) >>iZ; is then 

i. — ite/k —itOp/k +++ —itdp/k (> 
| —itp/k 1—i/k --- —itOp/k 


sO 
(3) 


—it0p/k  —itOp/k --- Ll— ‘oad 
[1 — (it0/k)(1 — p)]~* [1 — (t0/k)(1 — p + pk)”. 
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On referring to tables of Fourier integrals [2], one finds that the probability 
density is 


(4) p(x) = 1 po exp | —& | iF; (. kr — phx ar) 
c A(1 — p) *" @(L — p)(L — p + pk) 


where 
(5) ce = (0/k)“ (1 — py“? (1 — p + pk)* TAK) 
and where ,F;(w, v, z) is the confluent hypergeometric function which is given by 


a see we, ww +l) 2 
Fi(w, v, 2) = 1 + = i 7 vv + 1) 2! 


ww + 1)(w + 2) 


T e+ Dw ¥ 2) 


2 

eto 
For p = 0 the density given by (4) reduces to that of an arithmetic mean of 
k independent random variables, each of which has the density given by (1). 


4. Asymptotic solution of a certain transcendental equation. Before obtaining 
the large-sample distribution of the maximum value in samples from (4), it is 
necessary to consider the solution of the equation 


(6) z—-Alogxr+B=C 
where A, B, and C are constants and the value of C is large. If A < 0 there 
is one solution but if A > 0 there are two solutions, as is easily seen by con- 
sidering the intersection of y = x + Band y = A log x + C, and noting that 
C — o. However the only solution required here is the one which increases 
indefinitely as C increases indefinitely. 

Let r(x) = x — A logz + B — C. Now dr/dx = 1 — A/x > O for = large, 
which shows that the function is monotonically increasing for « sufficiently 
large. As a first approximation to a solution, try 


(7) 1 =C—B+A log C. 
This makes 


Y 
ax 
— 


( 


/ 


B — AlogC\ _ A (B — A log C) 
B ; 


(&) riz) = —A log {1 -- 


for C large. This approaches zero as C — . It can, in fact, be shown that the 
solution satisfies 


(9) «c=C—B+AlogC + o(1/C‘) 


where ¢ is an arbitrarily small number such that 0 < ¢ < 1. This is easily seen’ 
by noting that for C large enough, 


r(C —B+AlogC — 1/C*’) <0, 
r(C —-B+AlogC+1/C"“’) > 0. 


1 The author is grateful to the referee for simplifying this proof. 
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If B as well as C is permitted to increase indefinitely, the asymptotic solution 
(7) remains valid, provided B/C — 0. This requirement is obvious on referring 
to (8). However, the order of the approximation indicated in (9) may not be 
valid for this case. A sufficient condition for the validity of (9) is B = O (log C). 


5. Large sample distribution of the maximum for the case p > 0. The main 
problem now is to find the distribution function of the maximum in samples of 
size n from the population characterized by the density (4). In this section the 
distribution will be obtained first for large values of n, then for large values of 
k and n. The two cases p > 0 and p = 0 will also be treated separately. 

Let F(x) = fop(t) dt where p(t) is the density (4). Then [F(x)]” is the dis- 
tribution function of the maximum in samples of size n. The probability density 
of the maximum is 


(10) g(x) = nF*™"(x) p(z). 
Apply the transformation 
(11) Yn = n{l — F(z)). 


Then, if X is the random variable with density (10), and Y, is the random 
variable defined by (11), Y, has the probability density function 


py,(z) = (1 — 2/n)"", 


and it will be noted that lim,..py,(z) = ¢*, for x > 0. In solving (11) for z 
as a function of y, , it will be shown below that X = —a log Y, + 8 for large 
values of n (and hence large values of x). Consequently, by a limit theorem of 
Mann and Wald [3], the limiting distribution function of (X — 8)/a is the 
distribution function exp(—e*). 

We now proceed to solve (11) for x. The equation may be written 


(12) Yn = “| t* exp | ams | iFi(A, kA, dt) dt 
where 6 = pk”/0(1 — p)(1 — p + pk), and c is given by (5). 

Since the distribution will be considered for large values of n it will suffice 
to find the solution of (12) which is valid for large values of x (cf. Fisher and 
Tippett [4], Cramér [5]). In evaluating the integral in (12), the following proper- 
ties of the conflue t hypergeometric function will be required. 


(13) £ iF (w, v,z) = = iFi(w+i1, »v+1, 2); 


(14) iFi(w, v, 2) = eT) E + o(2)], 2 0. 


rT (w) 


Formula (13) follows from the definition of the confluent hypergeometric func- 


tion (cf. [6]). A more general asymptotic expression than (14) is established by 
Barnes [7]. 
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Integrate (12) by parts and apply (13) to obtain cy,/n = a; + a + as, where 


a a1 = p) 2 exp laa | F(A, kA, 52), 


oo 2B [ get exp lain | iF,(A, ky, bt) dt, 


a= 


60(1 — p) Ak—1 kt 
Qs Sel t exp | im | iFi(A + 1, kA + 1, dé) dt. 


Now define a = 6(1 — p + pk)/k. In virtue of (14), and for p > 0, these 
expressions reduce to 


— 1 — p) 2" TURD) (sr 
~ ko PD TA) lt +0(g =) 
— 91 — pla kd — 1-2 T(RA) (sia 
k §rte—-D * TO) © oe | 


61 — pa 2** (KA) -s10 
e k pe-D=i TA) 1+0 
As a result of the above simplification it is now possible to write (12) in the 
form 


_ 61 — p) (1 + ba)x** PKA ele 
A f xe E +0(2 ‘)]. 


TAQ) 


A further simplification is afforded by referring to the representation of ¢ in (5). 


Thus 
Ci yn BB 2 6 —z/a E + o(2 1)I, 
n 6x 


Ak—1 
a 


cq = (1 a ret) —p + aro). 
l+a 
It now remains to solve (15) as a function of y, . If the terms involving O(1/éx) 
are neglected, and we take logarithms of the remaining terms, we obtain 


(15) 


(16) x — a(r\ — 1) log xz + a(log y, + loge) = a log n. 
Define 
(17) A = afd — 1), B = a(log yz + log a), C = a log n. 
If k is small and n is large, then the solution (9) is applicable. Hence 
(18) t= —a log yx + 8 + 0 (1/[a log n)'‘), 

= a [log (n/c) + (A — 1) log (a@ log n)}. 


Thus, the limiting distribution of (X — 8)/ais exp (—e™~*), and a and 8 re- 
present respectively the scale parameter and the mode for large values of n. 
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If k as well as n is allowed to increase indefinitely, then in order to apply the 
results of Section 4, the expressions for B and C in (17) must be altered. In fact, 
equation (16) may be written as 


xz — a(\ — 1) log z + a log y, 
+ a log {2° (1 — pY*P"( — p + pk Pe PA)} 
= a log [kh "*{k + 50(1 — p + pk)}] + a logn. 

Now if we define 

B = a log yn + a log ("1 — py P"'(L — p + pkyP* Tay}, 

C = alogn + a log [k**{k + 80(1 — p + pk)}), 
it follows, since 6 is of the order of magnitude of k, that 

B® adk log k, C ®& adk log k + a log n. 


Hence, if k log k = o(log n), it follows that limy......B/C = 0, in which case 
(9) is applicable. This gives x = —a log y, + 68, with a = O0(1 — p + pk)/k 
as in (16) and 


(19) 8 = a log (n/c) + a(A — 1) log {a log [n(ka)”*(k + 60(1 — p + pk)]}. 


6. Large sample distribution of the maximum for the case p = 0. The dis- 
tribution of the maximum for the case p = 0 must be considered separately be- 
cause 6 now reduces to zero, and the approximations employed in the evaluation 


of a; , x2 , and a; are no longer valid for large values of x. With p = 0 the density 
in (4) becomes 


kh\—1 —ka/0 


p(z) = =2 lian z>0 


where now c = (0/k)“I(Ak). The same method as that described above is 
applied to 


mat [eet a 


For large values of x this yields 


cyn/n = (6/k)e "2 "{1 + O(1/z)). 
As before, neglect the terms involving O(1/z) and take logarithms of the re- 
maining terms to obtain 
(20) a — a(kd — 1) log z + aflog yn + log {a “T(Ak)}] = a logn 
where now a = 6/k. If k is small and n is large the solution becomes again 
xz = —alog y, + 8, where now 


‘ 
(21) B =a log 2=T0p,+ a(ky _ 1) log (a log n). 
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If k as well as n is allowed to increase indefinitely, then, as before, the ex- 
pressions for B and C must be altered to apply the results of Section 4. Rewrite 
equation (20) as 

z — a(ky — 1) log z + @ log yn = a@ log n — a log [a* “T(Ak)]. 
If we define 
= log pei ' 
o*—!1'(\k) 
then lims..c.n+0 B/C = 0 if a*“T(ak) = o(n). We obtain, as before, 
xz = —alog yn, + B, 


B = a log yn, C 


where now 


(22) B = alog 2FTE) + a(krA — 1) log {e log atm 

6. Conclusion. A few remarks are in order regarding the results obtained in 
this paper. Firstly, the cases in which both k and n are allowed to increase in- 
definitely require that k should not increase too rapidly relative to n, if the 
results obtained are to remain valid. Further, the order of approximation for 
these cases need not be the same as for the cases in which only n is allowed 
to increase indefinitely. 

Secondly, some extensions of the results obtained here require further re- 
search. For instance, as has already been stated, the correlation need not be 
the same for all pairs of Z,’s. Further, the initial distribution from which samples 
of size n are taken need not be fixed but might change during the course of the 
sampling. This might further be complicated by the fact that the observations 
in the sample could be correlated. 
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SIGNIFICANCE PROBABILITIES OF THE WILCOXON TEST 
By Evetyn Frx! anno J. L. Hopags, Jr. 
University of California, Berkeley 
0. Summary. Tables are presented from which exact values of the Wilcoxon 
distribution may be obtained when the smaller sample size m does not exceed 


12. The Edgeworth approximation to terms of order 1/m? is given and its ac- 
curacy investigated. 


1. Introduction. We are interested in the problem of obtaining significance 
probabilities for the Wilcoxon unpaired two-sample test [1], [2]. Let m < n be 
positive integers, and let Ri < R, < --- < R,» represent a random sample of 
size m drawn without replacement from the first m + n positive integers. Let 
S; = R; —iand U = 8,+ S.+ --- + Sa, and let r(u, m, n) denote the dis- 
tribution function of U. It is the values of the function that are required in 
the Wilcoxon test. [Wilcoxon actually considered W = Ri +---+ Ru = 
U + 4m(m + 1)}. 

Mann and Whitney [2] have tabled’ x to 3D for n < 8, and have shown that 
x, suitably normalized, tends to the normal as m, n —> «. White [3] has tabled 
the largest value of u for which r(u, m, n) S 0.005, 0.025 for m + n S 30. 
Auble [4] has published a similar table for m, n S 20, and significance levels 
0.001, 0.005, 0.01, 0.02, 0.025, 0.04, 0.05, 0.1. These tables and the normal 
approximation serve most ordinary needs in hypothesis testing. For some pur- 
poses (such as relative efficiency studies, in which it is the relative erro: that 
matters) the normal approximation is not sufficiently precise, and the restric- 
tion n S 8 of the Mann-Whitney table is confining. The White and Auble 
tables give significance probabilities in most cases with even less accuracy than 
the normal approximation. [[3] contains several errors of one, apparently due 
to rounding. | 

The connection of x with a partition function is well known. If A(u, m, n) 
denotes the number of ways (without regard to order) in which it is possible 
to choose exactly m nonnegative integral summands, none greater than n, whose 
sum does not exceed u for, equivalently, the number of ways in which it is 
possible to choose exactly m positive distinct integral summands, none greater 
than m + n, whose sum does not exceed u + 4m(m + 1)], then 


r(u, m,n) = A(u, m, n) / (" + ~ 


m 
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2 According to a review in Mathematical Tables and Other Aids to Computation, Vol. 6 
(1952), p. 157, this table has been extended to n = 10 with 7D by H. R. van der Vaart. His 
table does not seem to be widely available in this country. 
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Simple recursion formulas permit the ready tabulation of A, but the problem 
of publication is formidable. The usefulness of a triple-entry table of exact 
values of A over the range of interest would scarcely justify the many pages it 
would require.’ 

Wilcoxon [1] presented without proof a formula which, for small values of u, 
permits one to obtain values of A from those of the double-entry quantity 
A,(u, m) = A(u, m, ©). This function A» was studied and tabulated by Euler 
[5]. [More precisely, Euler tabled ao(u, m) = Ao(u, m) — Ao(u — 1, m), which 
is the number of ways of partitioning the exact value u into m parts.] In Sec- 
tion 2 we derive an identity similar in nature to that of Wilcoxon, but valid for 
all values of u. We also present tables of Ao and of a related quantity A, , from 
which values of A are readily obtained. 

Our tables may be used provided m S 12. This requirement that the smaller 
sample size not exceed 12 is considerably less restrictive than that the larger 
sample size not exceed 8, but still will leave many situations of interest un- 
covered. We turn therefore to approximations, and develop in Section 3 a 
polynomial expression for the sixth central moment of U. This permits us to 
obtain simple formulas for the coefficients of the Edgeworth series for + to terms 
of order 1/m’. A numerical investigation indicates this series to be reliable to 
about 4D when m = 12. 


2. A combinatorial identity. To simplify notation, we adopt the conventions 
that A(u, m,n) and Ao(u, m) are 0 when u < 0, and that all variables of sum- 
mation are integers. We observe 


(1) A(u, m,n) = Aolu,m) — > Au —t, m—1, 2). 


This formula may be verified by observing that Ao(u, m) counts the partitions 
S,;+ Ss+---+S, S uwhereO < 8S; S --- S Sn, while A(u, m, n) counts 
those of these partitions satisfying the additional restriction S, S n. Since 
A(u —t, m-—1, f#) is the number of the partitions with S,, = ¢, the sum in 
(1) represents just the number of partitions counted by Ao(u, m) but not counted 
by A(u, m, n). 
We now apply (1) to itself repeatedly, obtaining the development 
A(u, m,n) = Ao(u,m) — ze Adu — th, m-—1) 
t 


>" 


(2) a Zo Apu —t—t, m— 2) 


te>ti>n 


Afu-t—-k—-k, m—3)+ OF 5 
tg>te>ti>n 
This formula may now be simplified by the change of summation variable 
8; = t; — n — i. If we write u — kn — 4k(k + 1) = w, the (k + 1)st term on 
3 Auble has attacked this problem by placing a table covering m, n < 20 on file with the 
American Documentation Institute, from whom it may be purchased for $4.25 (microfilm) 
or $12.50 (photostat). See [4], p. 14 for details. 
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the right side of (2) may be written 


(—1)* Do Adlw—(s +--+ +8), m— kl. 
OS*1S°**S% 
In this sum, the term Ay(w — v, m — k) occurs as many times as there are 
ways of partitioning v into just k nonnegative integers, that is, ao(v, k) times. 
The (k + 1)st term of (2) is thus equal to 


(—1)* a ao(v, k) Aolw — v, m — k). 
If we now write ; 
(3) Ai(u,m) = 2 ao(v, k) Aolu — v, m), 
we can present (2) in the form i 
A(u,m,n) = De (=1)*An(u — kn —3k(k + 1), m — k) 


(4) = Ao(u,m) — Ailu -n—1, m— 1) 
+ A:(u — 2n — 3, m — 2) — A;(u — 3n — 6), m—3) +-:-. 


The series is extended until the first argument becomes negative. Formulas (3) 
and (4) express the restricted partition function A in terms of the unrestricted 
partition function Ao. 


We present in Table I the values of Ao(u, m) for m S 12 and u S 100 [Euler’s 


table of ao covers m S 20 and u S 59]. These values were computed with the 


aid of the familiar recursion relation 
Ao(u, m) = Ao(u, m— 1) + Agu — m, m), 
together with the boundary values Ao(0, m) = 1 and Ao(u, 1) = u + 1. Values 


of A; for k > 0 can be computed from the relation 


u 


A,(u, m —k) = & Aolv, k) Ao(u — v, m — k) 


v0 


- > Ao(v — 1, k) Aolu — v, m — k), 
v=0 


but for convenience we also give in Table II the values of A2(u, m) for m S 11 
and u S 75. Table II was computed with the aid of 


A.(u, m) = Ao(u, m— 1) + Ax(u — m, Mm), A,(0, m) = 1, 


which follow from (3). In the use of (4) for the range covered by our tables, 
one often needs A; and occasionally A;. These quantities are readily obtained 
from Table II with the aid of 


A,(u, m) = Ao(u, m) — Ao(u — 2, my) 


(5) 
A;(u, m) = A,(u, m) + Ax(u — 3, m) + Alu — 6, m)+---. 
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These relations are so simple to use that tabulation of A; and A; is unnecessary. 
Values of A, for k > 3 are seldom required. In general, 


Ai(u, m) = Dorzo Asa(u — rk, m). 


We illustrate the tables by computing 2(95, 12, 22). Using (5) and the tables, 
we find 


Ao(95, 12) = 124,610,703, 
A,(72, 11) = 358,414,629 — 277,080,764 = 81,333,865, 
A,(48, 10) = 9,263,517, 
A;(23, 9) = 49,969 + 22,012 + 8,968 + 3,317 
+ 1,080 + 296 + 62 + 8 = 85,712. 


Combining, A(95, 12, 22) = 52,454,643; this is the exact number of partitions 
of 95 + 312-13 = 173 into just 12 distinct parts between 1 and 34 inclusive. 
Since (73) = 548,354,040, we find (95, 12, 22) = 0.095 658 --- . 


3. Approximations. Our tables provide values of x(u, m, n) only for m S 12 
and u < 100. As is shown below, the normal approximation at these limits is 
subject to sizable percentage errors. In the search of better approximations for 
m > 12, we turn to the Edgeworth series, which to terms of order 1/m’ is (taking 
advantage of the symmetry of U) 

(6) a(u, m,n) = &(x) + Ee (xz) + Ore (2) + LO.~ (x), 


where x is the normalized value of u. Using E(U) = 4mn and pw, = 
mn(m + n + 1) / 12, and the usual continuity correction, we take 


(7) x= (u+ 3 — 3mn)/Vmn (m+ 2 + 1) / 12. 
The Edgeworth coefficients are given by 


Pa - i (4-3), oi, = 1 (M — 15 + 30), 
4! \us 6! \ui Ba 
(8) 


where yu; is the kth central moment of U. Mann and Whitney give 


Mm = moe th [5(m?n + mn?) — 2(m? + n”) + 3mn — 2(m + n)). 


They show [their formula (14)] that 


(9) ws = as S| [35m'?n?(m? + n®) + 70m'n® + P(m, n)] 


where P(m, n) is a symmetric polynomial of 5th degree in m and n. When m = 1 
and m = 2, the distribution may be given explicitly and the moments deter- 
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mined. In this way it may be shown that 
P(m, n) = —42 mn(m* + n*) — 14 m’n?(m + n) + 16(m* + n‘) 
(10) —52 mn(m? + n*) — 43 m’n® + 32(m*> + n°) 
+14 mn(m + n) + 8(m + n?) + 16 mn — 8(m + n). 
If we substitute (10), (9) and (8) into (6) we find after simplification 
m+n +m+mt+n w 
~~ 20mn(m +n+1) * (x) 


[2(m* +n‘) + 4mn(m? + n”) + 6m'n? + 4(m*® + n°) 
+ 7mn(m + n) + (m? + n?) + 2mn — (m+ n)) & 
OS ee 


(m? + n° + mn + m +n)" Oy) 
~ 800m2n?(m + n + 1 )? , 

An appreciation of the accuracy of the Edgeworth approximations at the limit 
m = 12 of our tables may be gained from an examination of Table III. Column 
(a) gives the normal approximation; column (b) the first two terms of (11); 
column (c) the entire approximation (11); the last column gives the exact value. 


r(u,m,n) = &(x) — 


oe 


TABLE III 


(a) (b) ™ (%, m,n) 





17359 . 17368 
-01671 d .01662 
-00285 .00280 .00278 


.07303 .07308 .07307 
01588 .01584 .01583 
.00254 .00202 .00199 .00198 


It appears that (11) may be relied on to about 4D when m = 12, and its ac- 
curacy should improve with large values of m. The normal approximation (a) is 
subject to large percentage errors at the high significance levels, and is much 
improved by the use of the simple term in ¢“(z). 
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ON THE FOURIER SERIES EXPANSION OF RANDOM FUNCTIONS' 


By W. L. Roor anp T. S. Prrcuer 


Massachusetts Institute of Technology 


Problems involving stationary stochastic processes are often treated by ap- 
proximating the original processes by Fourier series with orthogonal random 
coefficients.” In this paper we justify this technique in certain instances. 

We let x(t) denote a real- or complex-valued stochastic process defined for all 
values of ¢. We assume the first and second moments of x(t) exist. We write 
R(s, t) = E{x(s)x(t)} and in case R(s, t) is a function of (¢ — s) only (that is, 
x(t) is stationary in the wide sense), we write p(t — s) = R(s, t). We assume every- 
where that E{x(i)} = 0. 

We define the stochastic process x(t) to be periodic if the random variables 
x(t,) and x(t; + 7) are equal with probability one for all 4; and some constant 7’. 
If x(t) is periodic, then R(s, t) is periodic in each variable. If x(t) is wide-sense 
stationary, then it is periodic if and only if p(r) is periodic. 

Our first result follows from the theorem due independently to Karhunen 
and Loéve which states: Let x(¢) be continuous in the finite interval (a, b), then 


a(t) = Lim. > xivi(t) 
n+ 1 
where the y,(¢) form an orthonormal system over (a, b) and where E{z,z;} = 
\,6;; if and only if the ¥;(¢) and the \; are a system of eigenfunctions and eigen- 
values of the integral equation 


[ R(s, )~@ dt = »V(a)- 


TuHeorem 1. Let x(t) be a wide-sense stationary stochastic process continuous in 
mean square. Then 


n tkwr 
° re 2 
x(t) = Lim. he: JT ’ o= a 
on the interval (0, T'), where the x; are pairwise orthogonal if and only if x(t) is 
periodic with period T. 
Proor. From the Karhunen-Loéve theorem and the remark above about 
periodicity it follows that we need to show that 


Received August 17, 1954. 
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I R(s, )y(d) dt = A¥(s), 0 T, 


is satisfied by exponentials when and only when p is periodic with period T. 
Suppose the y,’s are exponentials, that is 


v(t) = a" / +/T, w = 2r/T, n; an integer. 


Then by Mercer’s theorem }> je'"/“‘e~*"** / \;T converges uniformly on [0, T] X 
[0, T'] to R(s, t). Now since the process is stationary, R(s, t) = p(s — 1). Hence 
>" / \;T converges uniformly to p(r) in the interval —7 < + < T. Thus 
p(0) = p(T’) and since x(t) is wide-sense stationary, E{\x(t + 7) — x(Z)\*} = 
p(0) — 2R [p(T)] + p(0) = 0. The converse is obvious. 

Davis [2] shows that if z(é) is a wide-sense stationary, continuous-in-mean- 
square process which has for every T > 0 a Fourier series, with random orthog- 
onal coefficients, which converges to x(t) in mean square over (0, 7’), then x(t) 
must be the trivial process with p(r) = const. This assertion follows from 
Theorem 1. For to satisfy Davis’ hypothesis, p must satisfy p(t) = p(t + 7), for 
every T' > 0, and the only functions with this property are constants. 

A somewhat different problem from the expansion of a random function is 
the statistical representation of a random function. In particular, here we ask 
for conditions under which a sum of exponentials with orthogonal random co- 
efficients has the same multivariate probability distributions as a given random 
function. If z(¢) is Gaussian, such a statistical representation essentially always 
exists. 


THEOREM 2. Let x(t) be a stationary Gaussian process with correlation function 


p(r) = Lim. as. _ = aia Z wo = ae 


9")? mm” ° 
mo -—n 2 7 


Then there exists a process y(t) defined by 
y(t) = Lim. >> me 
where the x, are complex Gaussian variables satisfying 


E{x} =0, E{aa.} = ce, E{xax;} = 0, k #j,t% = 22, 


which has the same multivariate distributions as x(t) over the interval 0 S t S 3T. 
Proor. It is easily verified that if y,(f) = }-",ne*""*, then 


lim E {|ym(t) — ya(t)|?} = 0, 


so the process y(t) is defined. Then 
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E{y@®y(t os r)} — P Lo Eltezajete rr ia Xu { |xa|*}e" ™*" — p(—7), 


which proves the assertion. 

For any T < «, J" E(\x(t)|*) dt = 2TE(\x(0)|") < «, so that, by Fubini’s 
theorem, x(t) is almost always measurable and square integrable on (—T, T). 
Hence for almost every sample function z(t) = lim. }°c,e'""“” on the interval 
(—T, T), where c, = (27) Jr x(s)e""*’” ds. The process defined by the two 
preceding equations has the same (a.e.) sample functions as the original on 
(—T, T). We have 


—_ rT Tr > 
Elea,) = (27) 7 ds [ dt eX" O/T (gp _ 1) 
(1) (ot) T 


1 
- (47) [ dv Sor du elt o(u) 


v—1)T 


where we have made the substitutions u = s — ¢tand thenv = s/T. Ifn = m, 
then the integrand is dominated by 27'p(0), so that E(\ca|’) S p(o). That is, 
c, is square integrable and E(c,c,,) exists. 

Tueoreo 3. If p is integrable on (— ~, ~) and J”, p(t) dt ¥ 0, then E(\c,|") = 
O(1/T) > 0. If alson ¥ m, then 


i E(CnCm) 
——______~___ = (). 
ree (E((eal?)E (cml) 
Proor. The functions fr defined by 
(v+1)T 
fr(v) = du e"*™'* o(u) 


v—1)T 


are uniformly bounded by f=, |p(u)| du. For every v, with |x| < 1 


fr(v) +f p(u)du=a as T-> ~, 


Hence, by Lebesgue’s theorem applied to (1) 


n ~~ ™m, 


n=>m 


0 
lim (4T)E(Catmn) = 
2a 


This implies the result. 
TueoreM 4. If p is square integrable and r is its Fourier transform, then 


pgecctge (a ge ot) sin’ u 
@) Beste) = a [or (Spe ula + (n= myx] 
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PRoor. 


a 1 1 7 (v+1)T : A 
E(Cnem) = aT - er” / du melt im, dw eorr(u)h 


v—1)T Ae A 


(e+)T r A 
sed dv e—™” lim | du grat [ dw e¢ *“’r(w) 
po (v—1)T A 


af ir(n—m)v i(mx/T—2ew) oT sin 2rwT dw 
li “2 —— 
rT. ela (mx/T — 2mw) 
oa (—1)"" 7 of sin 2rwT [ ixv(n—2wT) 9, 
= lim - oT ‘ r(w) (mx/T — 2xw) dw é dv. 
This yields the formula of the theorem. The first step is justified by an applica- 
tion of the Schwarz inequality, the second by the Fubini theorem, and the third 
by the Lebesgue bounded convergence theorem and the Fubini theorem. 


TuHeoreM 5. If p is square integrable on (—~«, ~) and its Fourier transform 
vanishes almost everywhere in (—e, €) for some « > 0, then 


E (Cn Cm) = se 
lim Tie) Bde = 6) 


for all n and m. 


Proor. By the hypothesis, the integrand in the previous theorem vanishes 
if —2reT — nx S u S 2xeT’ — nz and, for any @ < 1, T can be chosen so 
large that outside this interval 


sin’ u 6 sin’ u 
uu+t(n—m)r]~— wv ~* 


Since r(x) = 0 for almost all z, for large T 


; € a 7) sin? u i (“ nm) sin’ u 
QnT J ulu + (m — n)x| = 2nT w 


Integrating this gives (—1)"*" E(cac,) 2 @ E(\ca\°). Hence for large enough 7, 


1 > (DE Ca en) 


= [F(eP) (eal = ° 


which implies the result. 

Instead of holding n and m constant, we can fix the frequencies associated 
with them, that is, set n = a7’ and m = bT. 

TuroreM 6. If p is integrable and square integrable, if a is a real number and 
p * q are integers such that r(a/2) > 0 and r((q/p)(a/2) > 0 then 


iti eS: k = 0,1,2, -- 


i EC Cok Ct) : 
Tyre [E(\ le pe| (2) B(\cqu!*) }? a 


In fact the expression above is O({(p — q)k\~*) for any @ < 1. 
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Proor. By Lebesgue’s theorem and the boundedness of r, which follows from 
the integrability of p, 


4rpk 2 _ [°_ (au , a\ain’ u a\ [*sin'u, _ a 
sr nilent) = [7+ (S+§) Se du r(8) f° 2 au = 2(5). 


Similarly, (Aap / qa) E\(\cq|") —> r( (q/p)(a/2) +). Thus 


< a sin’ u 
E(cn ce) -| [. (ee +t 5) a + @— Pike 


b __sintu __| 
<=; max |r(z)| [ | ate +(q- per] |? 


But the above integral is O(([¢ — p]k)~*) for any @ < 1. 


THEOREM 7. If p is integrable and square integrable, a is a point at which r(a/2) > 
0, and p and q are integers, then 


° _E (Cy Cong ) 
lim sil = 0, _ 0,1, 2, -:- 
Tp [E(\epe| 2) E(|epe+¢l?))'” 


Proor. As above, we get lim (4rpk/a)(E\c,x|") = lim (4rpk/a)E(\ep4¢\) = 
r(a/2)xr. Now 


4arpk a acl t ee = | 
. mar E(Cyk Cpe+¢) | = io (, * 5) u(u + gm) 


“|. f (s+ 5) - (5) anes | 


since f*., (sin’ u) [u(u + gr)]' du = 0 for g ¥ 0. But by Schwarz’s inequality 


| \2 2 2:3 

| 4epk 4, ——, | [ sin’ v “| (se +$) (¢)| sin’ u 
——E < one io 

| a (CoiCne +4) = Le v(v + gm) qt 2rpk 5 . 2/| w o 


The second integral approaches zero by Lebesgue’s theorem. 

It is easy to find examples of processes, vanishing around a/2 and b/2, for 
which the conclusion of Theorem 6 is false. Suppose, for example, r vanishes in 
the intervals 


G=s). @G)I 
US)G)- GSR) a> vr aero 


Then ¢E(\cy\") — |E(cpecge)| < 0 and eB (\cg|") — |E(cocp)| < 0. 

Since the coefficients corresponding to frequencies x with r(x) = 0 tend to mis- 
hehave, it is desirable to show that the total effect of such coefficients is small 
for large 7’. The following theorem does this for the band limited case. 

Txueoreo 8. If r is bounded and vanishes outside (— A/2, A/2) for some A, then 
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[ 2, Bel) + 2 Ble) |/X Blea!) = 0( 98-7), 


AT}+1 T? 


where [AT] is the largest integer not exceeding AT. 
PRoor. 


> Eel) = E( [wor it) = [Bef at = 27960), 


< » we 1 f*_ (ut nr\sin’ u 
Ftd = 2 ap Lo (he) 


T)+1 [AT j+1 2nT uv? 


Ve Fr sin’ 2xvT 
3 ein 7) (QuvT + nz)? -. 


< 1 i< 1 
Bi (QeroT + nx? = x (2oT + [AT] + 1 + 7)? 


. * (log 1)” (207 + [AT] + 1), 


< A/2 
: > El\en|*) = | r(v) sin’ 2rTv(log T)” (2vT + [AT] + 1) dv 


T)+1 


A/2 
< up () [ (log T)"(2uT + [AT] + 1) dv 
Tv A/2 


[AT]+1+4T 


_ sup (r) 
4nT Jiariqi—ar 


{[AT]+ATH1 
s= () (og T)’(w) =0 (‘2s r). 


= 2T [AT]—AT+1 7 


(log T)” (w) dw 


The other case is handled similarly. 
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A CHARACTERIZATION OF THE GAMMA DISTRIBUTION 


By EvGene Luxkacs 


Office of Naval Research 


1. Introduction. The sum and the difference of two independently and iden- 
tically distributed normal variates are uncorrelated and therefore also inde- 
pendent. Conversely, we can conclude from the independence of the sum and 
the difference of two identically and independently distributed random variables 
that both these variables are normally distributed. In this manner one obtains 
a characterization of the normal distribution. 

We denote in the following by 


0 xz <0, 
(1.1) F(z; a,\) = 


BY z 
Qa —1 —at 
—- t t 
Fa end z>0, 


the distribution function of the gamma distribution. The corresponding charac- 
teristic function is 


(1.2) f(t; a, ) = (1 — it/a)™. 


Here a and \ > O are two parameters. It is seen easily that a is a scale parameter. 

Let X and Y be two identically and independently distributed random varia- 
bles each having the distribution (1.1). It is known [2] that in this case U = 
X + Yand V = X/Y are also two independent random variables. 

In the present paper, we use this fact to derive a characterization of the 
gamma distribution which is similar to the characterization of the normal dis- 
tribution mentioned above. Our result can be formulated in the following manner. 

THEOREM. Let X and Y be two nondegenerate and positive random variables, 
and suppose that they are independently distributed. The random variables U = 
X + Y and V = X/Y are independently distributed if and only if both X and Y 
have gamma distributions with the same scale parameter. 

In the following it will be convenient to introduce the random variable 


(1.3) W = 1/1 + V) = Y/(X + Y). 


The random variables U and W are both nonnegative. Moreover, W is a bounded 
random variable and 


(1.4) 0sW sl. 


2. Analytic properties of the characteristic functions of X, Y, U, and W. 
In this section we consider the characteristic functions of these random variables 
and investigate in particular whether the integrals defining these functions exist 

Received July 2, 1954. 
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also for some complex values of the argument. In addition, we discuss some 
properties of these functions which we need in the subsequent sections. 

We denote in the following by F(x), G(y), Hi(u), and H.(w) the distribution 
functions of the random variables X, Y, U, and W respectively. Since X and Y 
are nonnegative, the characteristic functions of these random variables are 


(2.1) fle) = [ are), g() = [ e aaw). 


Due to the nonnegativity of X and Y, the range of integration extends only 
from 0 to ~. Therefore these integrals exist not only for real v but also for com- 
plex values v = s + it, where s and ¢ are real, for which ¢ = Im(v) 2 0. It 
follows then from well-known properties of the Laplace integral that the func- 
tions (2.1) are analytic for ¢ = Im(v) > 0 and that 


f'Wv) =i [ ze'* dF (zx), f’(v) — [ ze’* dF(z), 
(2.2) : " 


go) =i] ye aGw, 9") =- | se aay). 
0 0 


We need also the following continuity property of the integrals (2.1), 
(2.3) lim f(s + it) = f(s); lim g(s + it) = g(s) 
ty ty 


where ¢ | 0 means that ¢ approaches zero from above. 
We consider next E{e'*”}, the characteristic function of the random variable 
W. It is seen from (1.4) that 


1 


+0 ‘ 
E{e*™} -/ e” dH-(w) = | e” dH2(w). 
© 0 


This integral exists for all values of z and is an entire function of z. The random 
variable U is nonnegative. Its characteristic function is 


- 


Efe") = f. e" dH,(u) = | e’™ dH,(u). 


Therefore this integral exists for all values of v for which ¢ = Im(v) 2 0 and is 
analytic for all v for which ¢ > 0. Similarily the expectation E{e””’***”} exists 
for all z and all v such that ¢ = Im(v) = 0 and is analytic for all z and all v with 
t = Im(v) > 0. 

We conclude this section with another remark concerning W. The random 
variable W is bounded; therefore all its moments exist. We denote in the fol- 
lowing by 


Y ee 


It is then easy to see that 
(2.5) 0<@ S56 <1. 
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3. The independence of U and V. The random variables U and V ave assumed 
to be independent. It follows then from (1.3) that this is equivalent to the in- 
dependence of U and W. We denote by H(u, w) the joint distribution of the 
random variables U and W and have then 


(3.1) H(u, w) = H,(u) H2(w). 
We conclude from (3.1) that Efexp (wU + izW)] = Efexp (ivU)] Efexp (izW)], 


or 


E{ exp | w(X + Y) + ety]! 
a 7 
(3.2) | ¥ 


; 1zY 
= E {exp [iv(X + Y)]} B{ exp FSi 
We have shown in the preceding section that the expectations in (3.2) are 
analytic for all z and for all v such that ¢ = Im(v) > 0. We now write (3.2) ina 
more explicit form and obtain 


[[ exp] ioe +) + | arte) caw) 


(3,3) = 


= [f exp [iv(z + y)] dF(z) dG(y) [fp exp | = Jere) dG(y). 


tty 


The integrals in (3.3) converge for all complex z and all v = s + it such that 
t = 0. Therefore they are analytic in z and also in v, provided thatt = Im(v) > 0. 

Equation (3.3) is the starting point for our investigation. We shall derive from 
it two relations for the unknown characteristic functions. 


4. The relation between the characteristic functions. We assume for the time 
being that t > 0. Therefore we can differentiate (3.3), first with respect to v and 
then with respect to z. In this manner we obtain 


feu | arte) aa) 


[[ vee +0 +2 


(41) - | | ” (x + y) exp lio(z + y)] dF(2) dG(y) 


; 7. izy 
eee 
If we put here z = 0 and use the notation (2.4), we see that, if ¢ = Im(v) > 0, 


[ | , y exp [iv(z + y)] dF(x) dG(y) 
"See 


= 4 | | ” (@ + y) exp liv + y)] dF (x) dG(y). 
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We use (2.1) to express the integrals in (4.2) in terms of the functions f(v) 
and g(v) and obtain 


(4.3) (1 — 4) g’(v) fv) = &f"(v) g(r), t = Im(v) > 0. 
From (2.3) and from 


(4.4) f) = 9) = 1 
we conclude that there exists a neighborhood of the origin such that 
(4.5) fv) ¥ 0, g(v) # 0, 


for all values of v belonging to this neighborhood for which ¢t = Im(v) 2 0. 
(This neighborhood could, of course, be the half-plane t = 0.) Moreover, it 
follows from (2.1) that f(t) and g(it) are positive for real ¢ > 0. From the con- 
tinuity of f(v) and g(v) we see that every point of the segment 0 < Im(v) < 1 
has a neighborhood in which (4.5) holds. We conclude then from the Heine- 
Borel covering theorem that there exists a simply connected domain 4%) contain- 
ing the interval 0 < Im(v) S 1 in which f(v) and g(v) do not vanish. In the 
following we restrict ourselves to this domain. We may then divide (4.3) by 
f(v)g(v) and obtain 


(4.6) (1 — 6:)(g’(v)/g(v)) = &(f"(v)/f)). 


In the domain D we may introduce the logarithms of f(v) and g(v) and integrate 
(4.6) using the initial conditions (4.4). We obtain finally 


(4.7) (g(v)}'** = [fry]. 


This relation is certainly valid for values of v for which the relations (4.5) hold 
together with 


(4.8) t=Imv>0. 


5. The differential equation. We derived relation (4.7) which connects the 
two unknown functions f(v) and g(v). To determine these function we must 
find a second relation. This may be accomplished by repeating the procedure 
which led from (4.1) to (4.6). We still restrict our considerations to values of v 
from 9) for which (4.5) and (4.8) hold. We may therefore differentiate (4.1) 
twice, first with respect to v then with respect to z. We put finally z = 0 and see, 
using again the notations of (2.4), that 


l | : y’ exp [i(z + y)v] dF (x) dG(y) 


= 0, Ly: (a + 2ry + y’) exp [iv(x + y)] dF(x) dG(y). 
0 ~0 


If we substitute for these integrals the expressions (2.1) and (2.2), we obtain a 
second relation between the functions f(v) and g(v), 


g” (v) fv) = OLf"(v) g(v) + 2f’(v) g’(v) + 9”) f)). 
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On account of (4.5) we may write this in the form 


g” (v) f(r) f'(v) g'(v) , g’(v) 
(5.1) a. [ao + 2G oh + OO]. 


It is now convenient to introduce the logarithms 


(5.2) o(v) = log f(v), y(v) = log g(r). 
Then 
f'(v) 


g (v) 
Flo = ¢(v), ato) = y'(v) 
f"(v) 


FO -vm+twor, Co -v0 +Woor. 


(5.3) 


From (4.6) and (5.2) we see that 
(5.4) (1 — 6:)¥’(v) = 4 ¢'(v), (1 — &:)¥”(v) = 6,¢7(2). 


After some elementary computations we obtain from (5.1), (5.3), and (5.4) the 
differential equation 


(5.5) (1 — 1) (0: — 6) ¢”(v) = (62 — 61) [o’(v) J. 


6. Solution of the differential equation. We first leave aside the cases where 
6; = 6, or 6 = 6; and consider the case where 0 < 6; < 62 < @ < 1. Then 


(6.1) a GMA) > 9 


and we may write (5.5) as 

(6.2) $”(v)/lo'(v) J? = 1/p 

for values of v satisfying (4.5) and (4.8). We denote by 

(6.3) ky = E(e*), ke = E(Xe*), a = (kip — ke)/ke 

and integrate (6.2) with the initial condition ¢’(i) = tke/k,. We obtain easily 


1 ban § kip —ka_ ev a 


¢(v) ~~ ip ipky pip 


so that ¢’(v) = (ip/a)/(1 — iv/a). We integrate this again and obtain, consid- 
ering (5.2), 


(6.4) f(ve) = (1 — w/a)” 


where c, is a constant of integration. It follows from (2.3) that lim,jof(i#) = 
f(0) = 1, therefore c. = 1. It is then seen from (6.4) and (4.7) that 


(6.5) fe) = (1 — w/a)”, = g(v) = (1 — v/a)”. 


The equations (6.5) were derived for all points v of the domain D. This restriction 
on v can now be removed. The functions f(v) = ff e’ dF(x) and (1 — iv/a)” 
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are both analytic in the upper half-plane ¢ = Im (v) > 0, and agree in the do- 
main D. Hence they agree in the half-plane Im (v) > 0. The same argument ap- 
plies to g(v). Finally it follows from (2.3) that (6.5) holds also for real values of v. 
The characteristic functions of X and Y are therefore given by (6.5) so that X 
and Y have, indeed, gamma distributions with the same scale parameter. We 
still have to consider the cases 6; = 6. and 6; = 62. If 0, = 6, then we see from 
(2.5) that 6. > 6{, therefore (5.5) reduces to ¢’(v) = 0. If on the other hand 
6; = 0, then we obtain from (5.5) the equation ¢”(v) = 0. By a reasoning 
similar to the one we employed earlier, it is seen that in both these cases the 
random variables X and Y have degenerate distributions. 

Therefore, we have established that the independence of U and V implies that 
X and Y have gamma distributions with the same scale parameter. We still 
have to prove the converse. This does not follow from Pitman’s result [2] quoted 
in the introduction, since we did not assume that X and Y have the same dis- 
tribution. 

Suppose, therefore, that X and Y are independent random variables and that 
their distributions are F(z; a, \1) and F(z; a, d2), respectively. We see, then, 
from (1.1) that the characteristic function of the joint distribution of U and V 
is given by 


E{ exp sax +Y)+ mh 


= TOTO) LI exp | —(a — it)(@ + y) + |. MAME! de dy. 


If we introduce new variables under the integral sign by setting 
= (a — t)z, n = (a — tt)y, 


we see easily that the integral can be written as a product of two functions, one 
depending on ¢, the other on z. Therefore, the random variables U = X + Y 
and V = X/Y are independent and the theorem is fully established. 

Our theorem is closely related to a recent result of R. G. Laha [1], who charac- 
terized the gamma distribution essentially by the independence of the ratio of 
certain quadratic forms from the mean. Laha assumed that the random vari- 
ables are identically distributed and that their second moment exists; both these 
assumptions were avoided in the present paper. 
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THE RATIO OF VARIANCES IN A VARIANCE COMPONENTS MODEL’ 
By W. A. THompson, Jr. 
Virginia Polytechnic Institute and University of North Carolina 


Summary. Our discussion will concern primarily \, the ratio of two variances 
which arise in discussing the “‘mixed”’ incomplete block model. In Section 1 we 
find first the class of invariant statistics for a test involving this ratio and second 
the joint distribution of these statistics. In Section 2 we use these statistics to 
construct a test (with certain optimum properties) of the hypothesis \ < Ao 
versus \ > Aj. 


1. The general incomplete block variance components model. Suppose that 
yij fort = 1,-+-,u, andj = 1,--- , b are independent and normal for given 
ti, --+, t, with means E(y,; | t) = ni;(t; + b;) and variance o’. Here n,; is 1 or 0 
according as the ith treatment does or does not occur in the jth block. The 
total number of observations is N, that is, ) :.;i; = N. In addition suppose 
that the ?’s are independent and identically normal with mean 0 and variance 
é’. If t were an unknown parameter instead of a random variable, we would have 
the general incomplete block model which appears in analysis of variance (see, 
for example, Bose [1]}). 

In the general theory of incomplete block designs we make use of the block 
totals B,, --- , By and of the “adjusted yields” Q,, --- , Q. . It is known from 
this theory that the latter form the basis of a vector space V, of dimensionality 
b, while the former generate a vector space Vg of dimensionality r, say. (In the 
case of a connected design, r = u — 1.) Further Vz and Vo are orthogonal to 
each other and to the error space V’ of dimensionality N — b — r. We may 
now choose an orthogonal basis for V’, say Y,,--- , Ywo+. 

Again from incomplete block design theory we know that 


E(B; \t) = kjbj + mij + rej + +++ + tue, 
E(Q; | 0) = cat: + Cote + +++ + Conte, 
E(Y;|t) = 0, 
and also that the covariance matrix of the B’s and Q’s and Y’s for fixed ¢ is 
ky 0: 


i 
\ 
| 
| 
“| 
1 
1 
i 
1 
| 
! 
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where k; = > 1 ;;, and the C matrix, involved in both the expectations and 
variances of Q, is again from incomplete block design theory. 

Now we state several lemmas which were independently developed by Ma- 
dow [5] and Skibinsky [6] and which we will find useful. 


Lemma 1. E[E(X | Z)] = E(X). 
Lemma 2. Var(X) = E[Var(X | Z)] + Var[E(X | Z)]. 
Lemma 3. Cov(X, Y) = EfCov(X, Y | Z)] + Cov[E(X | Z), E(¥ | Z)]. 


Applying these lemmas, we find the unconditional means and the covariance 
matrix to be 


E(B;) = k3b;, E(Q;) = E(Y,) = 0, 
Cov(B, Q, Y) - 


where the partitions with the stars in them are known though perhaps com- 
plicated constants. 


Now in order to simplify the problem as far as possible we will make the 


transformation Q,, --- , Qu ~ Z1, °°: , Zu: Q = MZ where M is an orthog- 
onal matrix such that 


e1 0! 

le 

0 Cr} 

M’'CM =| ------ etaivenbioaemnte 
1 

<4 

I 


a diagonal matrix with the characteristic roots of C in the diagonal. Note also 
that 


M’C°M = M’C(MM')CM = (M’CM) (M’CM) 
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Thus Z,,---, Z, have the covariance matrix D,o’ + D, ¢ while Z,+:, - 
Z,, are zero with probability one. 

Because of the orthogonality of the Z’s among themselves and the mutual 
orthogonality of Vs, Ve, and V’, the Y’s, B’s, and Z,,--- , Z, are N linearly 
independent linear functions in the space of the y’s and thus form a basis for the 
y space. We may therefore make a transformation of the y’s into the Y’s, B’s, 
and Z,,--- , Z,. These last variables are of course multivariate normal. From 
this and the nature of the covariance matrix of these final variates we see that 
B,,---,By,Z:,-:-, Z, and >> Y% are a set of sufficient statistics for the dis- 
tribution. 

Suppose now that we are interested in placing confidence limits on, or testing 
hypothesis concerning, the ratio <¢/o* = \, say. Let us consider a group G of 
transformations on our set of sufficient statistics. Let G be 


By =cByt+ke;, A=ch,--+,Z=h, (LY) =c(LY). 


a 


Since the effect of G is only to change the mean of B; and multiply the co- 
variance matrix of (B, Z, Y) by an arbitrary constant, c’, the problem is in- 
variant under G. In this connection, see Lehmann [4]. A maximal invariant 
under G is 


A/VEV,  B/VEY. 
Thus G induces the group of transformations G, 
b; = c(b, + ¢,), o =Co’, ef =c é, 


a maximal invariant for which is ¢/o” = \. Thus if we adhere to the principle of 
invariance, then in making inferences about \, we may restrict ourselves to func- 
tions of 


ZAlV> Yi, ++) Z2/V> Yi; 
We now find the joint distribution of the statistics 
= Z,/WJ/eXers+i; — / Xe = Z| VerXesis 
where X,41 = aa Yiandn = N—b—r. 


Let W; = Z; / Ve; ; then W; is N(O, o* + e; &) and since the W’s and X,4; 
are independent, their joint frequency function is 


1 w, x 
t n/2—1 | -} (+255 i #)) : 
const(2z,1) exp soa ;+ 3 


Making the transformation 


Xi = Wil VXe41 nn fas X, = W./VXen; 
we find that the probability element of X,, X.,--- , X-, 20 is 


(n+r)/2—1 Tr+t 
const(z,+1) exp | - 3 (1 + ear 1 z ei a) | 
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We may now integrate out over X,,; , noting that we have a gamma function 
in this variable. We find the probability ee of X,,---, X, to be 


tli (mtr) /2 
— ( * Lia + eid al 


2. A one-sided test on \. Let wo: A S Ay and wi: A = Ax where Ay < Aj. 
We now interest ourselves in tests of Hy: \€wo versus H;: XX €w;. The re- 
gion between Xo and ), is a zone of indifference to be determined by the experi- 


mental situation. That is, if \ < »’ < A1, then we do not particularly care 
whether we accept Ho or H,. 


Now consider an a priori distribution defined on w) which assigns probability 


1 to \ = Xo, and similarly a distribution on w, which assigns probability 1 to 
1A = At ° 

According to a theorem of Lehmann [3], if now we construct a most powerful 
size a test of \» versus \; which has power 6 and if we can show that this test has 
size a for the composite hypothesis and power = 8 for all A in a , then this test 
is the one which maximizes the minimum power. 


We may use the Neyman-Pearson Lemma to construct a most powerful test 
of X = Xo versus A = i, . If we let 


~14+ D0 Xi/(+enr) _ CYi + DO Zi / (es + ei ds) 
1+ > Xi /Q+erm DLYi+ DZ / le +e)’ 
then the above test becomes 
if R > c, accept hypothesis \ = Ai, 
if R < c, accept hypothesis \ = do. 


Here c is a constant chosen so that the test has significance level a. 
The power function of this test is 


Lyi zi )| 
B(A) = const i exp E (4# + >, a+ ee dy dz 


const f lorse | -3 (UK+L ‘) | df dg, 
where we have made the transformation 
F; = Y;/e, 
OF Z;/(e@ + ee)", 
DoF + DG + esd) / (1 + esdo) 


DF + LG + esd) / (A + esd) 
We may compute, by straightforward pres ee algebra, that 


a (Gr ae a + er) / a = 4 
sr + DGi(1 + e »/ / (1 + e&))? 


R'(A) = 
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which is greater than 0 except when 


Fi, =F, = ooo = Fi & G| = G, = +--+ =G, = 0. 


that c < R, so that Jri>c < Jri>e- Therefore 8 is an increasing function of 
R’ and thus of A. Thus 


Thus R’ is an increasing function of \. Also if R; < R2, thence < Rj implies 


B(A) S Bide) = efor all A ew, B(\1) S B(A) for all AX ew. 


Accordingly we have proved the 
TueEoreM. The test, accept or reject Hy according as R < c or R > ¢, is the one 
which maximizes the minimum power among all invariant tests. 
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ON MOMENTS OF ORDER STATISTICS FROM THE WEIBULL 
DISTRIBUTION 


By Junius LIeBLeIN 


National Bureau of Standards 


Summary. This note expresses the first two moments of the order statistics in 
samples from the Weibull distribution (sometimes referred to as the “third” 
asymptotic distribution of extreme values) in terms of known (incomplete B 
and T°) functions. A similar procedure is applied to the “‘second” asymptotic dis- 
tribution of extreme values. 


1. Introduction. Explicit formulas in terms of a certain tabulated function 
were derived in an earlier paper [1] for the moments of order statistics for the 
“first” asymptotic distribution of largest values, with cdf 


(1.1) F(x) = exp (—e”), y=(z—u)/B, —-2x7 <2a< om. 


This note extends the procedure to the other two asymptotic forms. However, 
the distributions for smallest rather than largest values are considered, with a 
view to possible application to breaking strength and fatigue problems. Without 
loss in generality for our purposes, the two other distributions may be taken 
(see [3]) as having cdf’s 


[2 — exp [— (—z)"], zs 0, 


(1.2) G(x) = 


"9 x > 


0, x , 


(1.3) H(z) = 

\l — exp (—2z”), > 0. 
The parameter m is positive. These distributions are designated, respectively, 
the second and third asymptotic types (for smallest values). The distribution 
(1.3) has been applied extensively, mainly by Weibull [5], [6], [7], [8] and will 
be referred to by his name. 


2. General approach. The main point of difficulty is in evaluating the double 
integrals that occur in the covariances, the means presenting little difficulty. 
It is interesting to proceed generally and try to discover other distributions for 
which the technique introduced in [1] will work. 

If P(x) is the cdf of the parent population, and p(x) is the corresponding df, 
then the joint df of the ith and jth order statistics from P(z) is [9] 


(2.1) plz, y) = KIP(@))"IPy) — P@)P "Lh — Pw)" pp), 
—-eo<ztsy<o, 
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where K = K(n, i,j) = n!/[(@@ — 1)! GG — i — 1)! (n — j)!] and z = 2; and 
y = x;, witht <j. 

If in (2.1) we write P(y) — P(x) = [1 — P(x)] — [1 — P(y)], the covariances 
may be expressed in terms of integrals of the forms 


[ [ ry [P(x)]}‘ [1 ane P(y))" p(x) ply) dx dy, 
(2.2) be. 


{. [. xy {1 — P(x)‘ [P(y)]* pz) ply) dx dy. 


Leaving aside simple cases such as the rectangular distribution P(x) = z, 
parabolic distributions P(x) = 2‘, and perhaps other simple forms for which 
the evaluation procedure in [1] is unnecessary, it will be seen on trial that suc- 
cess in using this procedure depends upon having P(x) or 1 — P(zx) of the form 
e* where Q is a reasonably simple function. Since P(x) and 1 — P(z) are non- 
negative, this is always possible with real Q(z). 

The integral (2.2) is then a (finite) linear combination of integrals of the form 


v(t, u) as f: [ agg Pte Q’ (x) Q'(y) dx dy. 


Here ¢ and u are not necessarily the same as in (2.2). 


To determine Q, we carry out the procedure as far as possible. The first step, 
integration by parts, gives 


V(t, u) ine rl ye™ Qty) Q’(y) dy aa, | [ ree Q(y) dz dy. 


Calling the double integral y(t, u), the second step, differentiating with respect 
to t, gives 


ons u) ~ Li {. y Q(z) Q’(y) eit +uew) dz dy. 


This can be evaluated in two cases, (i) when the inner integral can be evaluated 
as it stands, and (ii) when the double integral is, apart from a constant factor, 
precisely of the form y(t, wu). 

In case (i), we need to have Q(z) = cQ’(x), whence Q(x) = ae™, which makes 
either P(x) or 1 — P(x) doubly exponential. That is, we have the first asymp- 
totic form for smallest values, 1 — exp (—e*). The simple transformation z = 
—z’ converts this into the distribution of largest values, P(x) = exp (—e *), 
previously considered [1]. Thus this case has, essentially, already been treated. 

Case (ii) requires that 2Q’(rz) = cQ(zx), so that Q(z) = az’ (which, of course, 
includes the exponential distribution). This gives, essentially, just the two 
forms (1.2) and (1.3). 

It is rather interesting that the three asymptotic distributions of extreme 
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values thus seemingly exhaust the possibilities for the method introduced in con- 
nection with one of them. 


3. Results. The Weibull distribution (1.3) is of chief interest. The method in 
case (ii) applied to (2.1) and (2.2) for this distribution yields a simple first order 
differential equation in y(t, uv) whose solution is given by 


t 
vem y(t, u) — wt Y(u, u) = ~ r(2 + 2/m) / “nt + uy dt. 


The function ¥(u, u) is readily evaluated and the integral may be expressed in 
terms of the B-function by the change of variable ¢ = uw / (1 — w). The final 
expression for y(t, u) is 


(3.1) v(t, u) = m “(tu)” T(2r) B,(r, r), 
r=1+1/m; p=t/(t+u); mtu> 0. 


For given n, the values of y are needed only for ¢ + u S n. To cut the calcula- 
tions in half, we can use the relation 


v(t, u) + o(u, t) = m*(tu)” T(2r) [B,(r, r) + Bi_p(r, r)] 
m~*(tu)~T*(r). 


The above values enable us to write down the results for the Weibull dis- 
tribution (1.3) as 


ky n! k ~ _ 1-1 
Ba) = yt ait (1 +E) Ere 
‘(n-u—i+1e, 


m>0O; +=1,2,---,n; k=0,1,2,---; 
(3.2) 


i—1 j—i-l | 
E(xix;) = mK YD (-1)"Per'cr WG - itu», 
u=0 


y=0 


n—-jt+vtl), 
m>0O; i<j; 443 =1,2,-°-,n. 


Here K is as in (2.1), and the function y is given by (3.1). 
For the second asymptotic distribution (1.2), the results are similar and may 
be obtained with little difficulty. The function corresponding to y is 


vr, u) = m (tu) (P(") — TQr')B,(r’, r)), 1 = 1 — 1/m, 


p=it/t+u), m>2, tu>d. 
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The resulting moments are then 
ky n! _k wk pyi—l 
Bead = apt miT (1~ g) Bes 


‘(nt+yu—-i+17°™ 


m>0O; +=1,2,---,n k<m; k=0,1,2,-:-:-; 
(3.4) 


i—1 j—i-l 


Blewn) = mK ED (-1 ortor 


p= = p=) 
vn—-jt+vt+il, j-itu-—»), 
m>2; i<j; i,j, = 1,2,---,n. 
Here K is as in (2.1), and y¥* is given by (3.3). 


4. Remarks on application. The present results can be used in cases where m 
is known, perhaps from previous work. One important application of moments of 
order statistics is in finding minimum-variance unbiased estimators by means of 
linear functions of such statistics. An illustration of this for the case of the first 
asymptotic distribution will be found in [2], Appendix C. If m is not known, then 
there is available [4] a simple graphical procedure for obtaining an estimate 
which could then be used in the above formulas for first approximations. 
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ON THE ASYMPTOTIC NORMALITY OF SOME STATISTICS USED IN 
NON-PARAMETRIC TESTS 


By Meyer Dwass 
Northwestern University 


1. Summary. Let (R: , --- , R,) be a random vector which takes on each of 
the N! permutations of (1, 2, ---, N) with equal probability, 1/N!. Let 
(ay1, *** ,@ww) and (by1, --+ , byw) be two sets of real numbers given for every 
N. We will assume throughout that for no N are the ay; all equal or the by; all 
equal. We also assume that the ay; and by; have been so normalized that 


> avi = > bw: = 0; > ani = n*>;> by; = |. 
Unless otherwise stated, >> will mean }-%_,. Define 
(1.1) Sy = > ani bwr,; ° 


Let (xz) be the unit normal c.d.f. In Section 2 sufficient conditions are given for 
Pr{Sy < x} to approach ®(z) as N — o. (The first two moments of Sy are 0 
and N/(N — 1), respectively.) 

For every N, let Y = (Yu,---, Yin,,-+*+, Ymi,°**, Ymv,) be N = Ny 
+ --- + N,, random variables which are mutually independent and independ- 
ent of the R;. We assume that all Y;; with the same first subscript are identically 
distributed. Define 


P=2N*D.;%;, S=N*d.5(¥a—-P, sY%i; = Ys — Ff, 
Y:;; = Oif S = 0. 


Let Y’ denote the vector of the Y;;. Let y = (y1, °°: , yw) denote a point in 
N-space. By Fy(z, y) we mean thec.d.f. of therandom variable Sy = >> ayiyve; - 
In Section 3 are considered sufficient conditions for convergence with prob- 
ability one of the random c.d.f. Fy(z, Y’) to ®(z). 


2. Asymptotic distribution of Sy when the by; are nonrandom. Let Gy(z) 
denote the c.d.f. of the by; (continuous to the left). We assume that the by; 
have been so indexed that by: S bye S --- S byw. 

THEOREM 2.1. Suppose 

A) there is a c.d.f. G(x) such that limy.. G(x) = G(x) at every point of continu- 
ity of G(x) and 


[ xz dG(x) = 0, [. x dG(xz) = 1; 


and either 
B) lim max |ayi|}=0 or B’) G(x) = ®z). 


No 1SigNn 
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Then Sy (1.1) is asymptotically normally distributed. That is, Pr{Sy < x} —- 
&(x) as N — ~. 

This theorem can be compared with that of a previous paper [1]. There it was 
assumed that the by; were essentially powers of expected values of order statis- 
tics. In this paper we use the fact that the by; behave asymptotically like ex- 
pected values of order statistics. The idea of the earlier theorem [1] is then used 
in proving this theorem. In [1] the counterpart of G(x) was assumed continuous; 
in this paper we make no such assumption. Theorem 3.1, below, deals with the 
case where the by; are random variables. From now on, the function G(x) will 
be assumed to have the properties stated in Theorem 2.1. 

The proof of this theorem is aided by Lemmas 2.1 to 2.7 below. The main 
lines of the proof are as follows. Let X,, --- , Xw be independent, identically 
distributed random variables, each with the c.d.f. G(x); let Zn. S --- S Zww 
be the ordered values of the X’s. We will define below a vector function of the 
X’s, (Ri, --- , Rw), which has the property that it assumes each of the permuta- 
tions of (1, --- , N) with equal probability, 1/N!. Then >> ay; bye; = Sy is 
the random variable whose asymptotic normality we seek to prove. 

Either condition B) or B’) will assure us of the asymptotic normality of 
7 ayiX;. (See [1]; however, existence of third moment is not required. This 
follows from the counterpart of Lindeberg’s condition for a sequence of se- 
quences of random variables. I am obliged to Prof. W. Hoeffding for pointing 
this out to me.) 

Hence it is sufficient to show that >> ay;X; — Sy converges in probability 
to zero. Sufficient for this is to show that limy.. EO. ayiX: — Sy)* = 0. We 
have that 


(2.1) E> ayiX; — Sy)’ = EX{ — 2E(> aw:X;) (Sv) + N/(N — 1). 


The purpose of Lemma 2.1 is to show that our particular definition of 
(Ri, --- , Rw) provides us with the fact that ES GyiX;:) (Sy) = (N — 1)” 
‘>, EZy, by; . Since EX} = 1,then if limy..(N — 1) >> EZyibyi = 1, this will 
imply that (2.1) approaches zero as N — o. Making use of the fact [3] that 
limy.« N~’ >°(EZy;)* = 1, it is easy to see that 

lim N™ OEZyibwi = 1 —isequivalentto lim N~’ )o(by; — EZy,)’ = 0. 
N-0 N20 

The truth of this last limit is shown in the sequence of Lemmas 2.2 through 
2.7, where important use is made of some results of Hoeffding [3]. 


A random vector R = (R,,---, Rwy) is defined by considering the ordered 
arrangement of the X’s, 


(2.2) . aE as eee, 


If no two of the X’s are equal to each other, then let R;, = 1, 
Riy = N. If there are ties among the X’s then let 


(X;, Ss Xi, a-°°3 Xiy); eS one (Xj, s X i = 
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be the set of all possible distinct ordered arrangements of the X’s. Say there are 
p such arrangements. In such a case, let 


PU(Ri, > 1), (Ri, - 2), a (Riy ” N) | (Xi, _ a Xy)} _ 1/p, 


P(R;, - 1), (Rj, = 2), is (Riy > N) | (Xi, ---, Xy)} = 1/p. 
It is easy to see that R equals each of the N! permutations of (1, --- , N) with 
probability 1/N! 
Lema 2.1. E(D> awiX;) ( awibwe,) = (N — 10° D> EZyibyi. 


Proor. The left side can be written as 
(N1)7 >’ > avi byr.EQ, ayiX;| (Ri = 11), --- , (Rw = rw)} 


where (r:, --- , rx) is one of the permutations of (1, --- , N) and }-’ is sum- 
mation over all N! permutations. We also have that 


E(X, | (Ri * ri); "Tres (Ry = ry)] - E|Zy-; | (Ry = r), Ws: +% (Ry - ry)). 


Since the distribution of X,, --- , Xy is invariant under all permntations, this 
last conditional expectation does not depend on the values of the R’s and hence 
equals EZy,;. This shows that the left side equals 


(N17 3’ (& awibyr,) ( aviEZx,,). 


By an elementary calculation, this in turn is equal to the right side. 

Since Zu. S Zy2 S +--+ S Zyw, then EZm, S EZy2 S --- S EZyy . Denote 
by Hy(x) the c.d.f. of the EZy; (continuous to the left). 

Lemma 2.2 (Hoeffding). limy.. Hw(z) = G(x) for every x which is a point of 
continuity of G(x). 

This is proved by Hoeffding ([3], Theorems 1 and 2). 

Let S be that set of points on the real line where G(x) is discontinuous, to- 
gether with all points x where G(x — h) < G(x) < G(x + h) for every h > 0. 

LEMMA 2.3. Given « > 0 and & > 0, there is an even integer 2n and there is a 
sequence of closed intervals which are mutually disjoint, 


[tr ’ te), [ts ? ta], Soe oe [ten—1 ’ ton), 
having the following properties. 
a) With each t; we can associate an a; with O < a; < 1 anday; € S such that 
t; = a, Gy; — 0) + (1 — a;) Gly; + 0); 


b) yilte — &) + yale — fe) + °° + yintllen — tra) > 1L- 4; 

c) max [(y2 — y1), (Ys — Ys), *** » (Yan — Yana)] < ee. 

Proor. We will denote the right side of the equality in part a) by G,,(y,). 
Since fiy’ dG(y) exists in Riemann-Stieltjes sense for any A, B, we can find 
finite A’, B’ and a partitioning, 


Ab=y,<y < + Syn = B, 
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such that 
yr IG(y2) — G(ys)] + y2'IG@(ys) — G(y2)] + -- 
+ ym-1G(Ym) — G(ym—s)] > 1 — da, 
max (yisi — yi) < fe. 
is igm—1 


We can replace this partitioning by one 
A= yn < yo <-*** < Yon = B, 
and find constants 0 < a; < 1 where each y; ¢ S and for which 
YilG@o,(y2) — Ga,(yr)] + y2lGa.(ys) — Gas (ys)] + +> 
+ Yan—1Gas,(Yor) — Gar,—,(Yon—1)] > 1 — a2, 


and where c) is satisfied. The clumsy but elementary details are omitted. We 
need now only set ¢; = G,,(y;) for = 1, 2, +--+ , 2m; adjacent ¢t,;’s can be equal 
to each other. 

Lemma 2.4. For every N, let there be given an m = m(N). Let y ¢ S. If m/N — 
aG(y — 0) + (1 — a) Gly + 0), for some a where 0 < a < 1, as N — ~, then 
bym — y and EZym — y. 

Proor. Suppose limy..«bym = y + 6, where 6 > 0. Then there are 6’, where 


0 < & < 4, such that y + & is a continuity point of G, and a subsequence N, , 
m;, such that 


lim m:/N; = Gly + 4’) > aG(y — 0) + (1 — a) Gy + 0), 


which gives a contradiction. We treat lim by» similarly. Making use of Lemma 
2.2, we can prove this for the EZy; in the same way. This was proved by Hoeff- 
ding ((3], Lemma 5) without using the result quoted in our Lemma 2.2. 
Let T = ft, tulle , ju --~- ulten_s, ten], a finite union of disjoint closed inter- 
vals. Let > a; mean summation over all indices i such that (t/N) e T. 
Lemma 2.5 For every « > 0, there exists a finite union of disjoint closed intervals, 
[t; ’ t.Ju oR U[len—1 ’ lon| = 7 such that 


lm N20 by; >1—«¢ limN D>) (EZy)* >1—«. 
T T 


Proor. Choose « > 0. We can find a sequence of closed intervals which are 
mutually disjoint, [t: , t2), [ts , ta], --+ , [lena , teal, satisfying the requirements of 


Lemma 2.3 for « = 4; the value of « of Lemma 2.3 is not material here. 
Then 


wre’. We 
e 


aoN* | +n 2 Qh. 47" > a... 


NtiisigNts NissigNt, Nton-1BStQNtan 
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For simplicity, look at the first term on the right side of the above equality. 
Let p denote the number of integers contained between Né, and Nt, inclusive. 
Let [r] denote the least integer greater than or equal to r. Then 
N* Do i. = bywey) Np. 
NtiSigNte 
By Lemma 2.4 the first factor of the right side has the limit yj. The second 
factor has the limit ¢. — 4. Hence the limit inferior of the left side is greater 
than yj(te — t) — 4¢/n. Therefore 
lim N™* Do bis > Doysi—s (bey — ty-1) — Fe > 1 —e. 
N02 T 
The last inequality of Lemma 2.5 is proved in the same way. 
Lemma 2.6. Given « > 0, we can find a set T = T(e), such that Lemma 2.5 
holds, and also limy+2. N~*>>r (bwi — EZyi)® < «. 
We omit the proof, since it is very similar to that of Lemma 2.5. It uses Lemmas 
2.2-2.4; in particular, Lemma 2.3 is essential. 
Lema 2.7. limy.. ND, (by: — EZyi)* = 0. 
Proor. For « > 0, find 7 = T(e) satisfying Lemma 2.6. Write 


N >\(bw; — EZwi)® = N >> (bus — EZwi)® + ND’ (bni — EZy:)*, 
T 


where >.’ means summation over those indices not summed in }r . By Lemma 
2.6, for all N sufficiently large, N~'>>7 (by: — EZyi)’ < 2e. It is true that 


ND” (bwi — EZwi)® S ON DB, + VN (EZ - 


For all N sufficiently large, each of the terms under the square root sign is, by 
Lemma 2.5, less than 2e. Thus for all N sufficiently large, N~’ > (bw; — EZy:)’ 
is arbitrarily small, which proves the lemma. 


3. Asymptotic distribution of Sy when the by; are random variables. We will 
now deal with the generalization of Theorem 2.1 that was described in the last 
paragraph of Section 1. We refer to the definitions made there. Let Gy(z, Y) 
be the proportion of Y;; which are less than z, and let Gy(z, Y’) be the propor- 
tion of Y;; which are less than x. Let G,(x), --- , Gm(x) be the e.df.’s of Yu, 
Yu,-*: , Ym respectively. Let 


~ 


| x dGifx) = uw, | (x — wp)’ dGx) = oi, G(x) = > k:G(z), 


i=l] 


where k,, --- , k» are constants defined in Theorem 3.1 below, with k; 2 0, 
and > k; = j. 

TuHEoreM 3.1. Let G’(x) be the c.d.f. obtained from G(x) by a linear transforma- 
tion of the independent variable, so that the first two moments of G’(x) are 0 and 1. 
Suppose 

A). limy.. Ni/N = ky exists fori = 1,---,m; 

B) 0 <a; < ~; 
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C) condition B) or B’) of Theorem 2.1 is satisfied for G’ (x). 

Then Fy(x, Y'), as defined in Section 1, converges with probability one to (x). 

Proor. According to the Cantelli-Glivenko theorem, and some well-known 
facts about strong convergence, we have that Gy(t, Y) converges with probability 
one to G(¢), uniformly in ¢ ((2], pp. 257-279, 280). Also we have that, with prob- 
ability one, Y and S* converge respectively to the mean and variance of a ran- 
dom variable distributed with c.d.f. G(x). Then, with probability one, Gy(t, Y’) > 
G’(t) at the continuity points of G’(t), as N — «. Hence the required result fol- 
lows from Theorem 2.1. 


4. Applications. Some applications of a theorem like Theorem 2.1 have been 
pointed out previously [1]. We wish here to point out that Theorem 3.1 can be 
useful in evaluating the “large sample power’’ of tests of the sort that were stud- 
ied by Hoeffding [4], specifically his Theorem 6.2. Analogous results can be ob- 
tained where the X’s of that theorem are essentially like the Y’s of our Theorem 
3.1. Also, the results of this paper can be used to study analysis of variance tests 
like those of Hoeffding’s Section 5. We plan a future paper on the large sample 
power of analysis of variance tests of this type. 
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EMPIRICAL POWER FUNCTIONS FOR NONPARAMETRIC TWO- 
SAMPLE TESTS FOR SMALL SAMPLES! 


By D. TercHroEew 
University of California, Los Angeles 


A number of nonparametric or “distribution-free” tests have been proposed 
[2] for the problem of testing the hypothesis that two samples come from the 
same population. Some of these tests are functions of the “ranking” sequences 
which are obtained by arranging the observations from both populations from 
smallest to largest, and then replacing the observations from the first population 
by zero and the observations from the second population by one. 

If a sample of m is taken from the first population and a sample of n from the 
second, there are (m + n)!/m! n! different sequences. (Since we are dealing with 
continuous distributions, the probability of a tie is zero. The computer auto- 
matically carried about eight decimal places for each normal deviate and there- 
fore there was no need to investigate ties.) If the populations are identical, 
all sequences have equal probability of occurring. If the populations are not 
identical, some sequences have a higher probability than others. 

The hypothesis can be tested by the use of a critical region. The simplest 
test consists of selecting, in advance, k particular rankings to be in the critical 
region. If one of these rankings occurs, the hypothesis will be rejected. The size 
of the critical region, that is, the probability of rejecting the hypothesis when it 
is true, is given by 


m!n! 


. (m +n)!" 

Non-integer values of k can be obtained by the use of randomized tests. 

The problem of which rankings to place in the critical region is settled on the 
basis of power. Let P(R,) be the probability of a ranking under an alternative 
hypothesis. The power, for any alternative, is obtained by summing the P(R,) 
for all R; which belong to the critical region. Therefore, to construct an opti- 
mum test, one wants a test which, as a is increased, places the rankings into the 
critical region in the order of decreasing P(R,). 

Clearly, this test is the optimum test against the alternative, since it maxi- 
mizes the power. In general it has been possible to construct such optimum tests 
only against simple alternatives. It is of some interest to determine whether the 
construction yields optimum tests against classes of alternatives, that is, whether 
uniformly most powerful tests exist. 

Consider the special problem of testing the hypothesis that two samples come 
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TABLE II—continued 
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from the same normal population against the alternative that they come from 
normal populations with the same variance o”? but with means which differ by 
o6. The optimum test for this problem is, of course, the ¢ test. The optimum 
rank test for this problem, under the assumption that 6 is small, is the c; test 
which was given by Hoeffding [1] and studied in detail by Terry [4]. 

An interesting practical problem arises as to the behavior of the c,; test for 
large values of 6, for example, should one use the c, statistic in preference to 
the rank sum? The question may be phrased in the following way. If the rank- 
ings are ordered so that 


P(R:) = P(R2) = P(R;) = -:- 


is true for small values of 6, will this condition hold for all values of 6? 

Tables I and II give the empirical frequencies, in percent, of all possible rank- 
ings which are obtained when a sample of m from N(0, 1) and a sample of n 
from N(—6é, 1) are ranked in order of size and the individual values are replaced 
by 0 if they come from N(0, 1) and by 1 if they come from N(—4, 1). For the 
two extreme rankings, exact probabilities have been computed [3] and are given 
for comparison. The tables also give for each ranking the corresponding ¢; value. 

The frequencies were obtained as a by-product of a sampling experiment per- 
formed on the SWAC. The number of samples computed for the different values 
of 6 varied in the experiment. The values for n = 2 in Table I are based on 2000 
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samples for 6 = .25, .50, .75, 1.25, and 1.50, and on 1000 samples for 6 = 0, 
1.00, and 2.00. The values for n = 3 in Table I are based on 4000 samples for 
each 6 except 2.50, in which case 2000 samples were obtained. Table IT is based 
on 5000 samples for n = 2 and 7000 samples for n = 3, for all values of 6. 

The rankings in the tables have been arranged in the order of their c; values; 
in cases where the c, value is the same for two or more rankings, the ordering 
is by increasing probability. The tables show that the probabilities increase 
essentially monotonically. The deviation from monotonic increase can be ac- 
counted for by sampling fluctuation. These tables, therefore, indicate that, at 
least for some significance levels, it may be possible to construct uniformly 
most powerful rank order tests for the hypothesis. 

The tables may be used to estimate the power function of any rank order test 
for testing whether two samples from normal populations with the same vari- 
ance have different means for sample sizes (m,n) = (3,2), (3,3), (4,2), and (4,3). 
The tests may be randomized or unrandomized and one-sided or two-sided. 

I am indebted to I. R. Savage for suggesting the problem and for helpful dis- 
cussions about the results. 
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NOTES 


A NOTE ON THE THEORY OF UNBIASSED ESTIMATION 
By D. Basu 
University of California, Berkeley 
1. Summary. It is shown that even in very simple situations (like estimating 
the mean of a normal population) where a uniformly minimum variance un- 
biassed estimator of the unknown population characteristic is known to exist, 


no best (even locally) unbiassed estimator exists as soon as we alter slightly 
the definition of variance. 


2. Introduction. Let (9, @) be an arbitrary measurable space (the “sample 
space’’) and let {Ps}, 6 ¢ 2, be a family of probability measures on @. A real- 
valued function u = ye of @ is “estimable’”’ if it has an “unbiassed estimator’’. 
An unbiassed estimator of uv is a mapping 7» = 7, of the “sample space” X onto 
the space of all probability measures over the o-field of all the Borel sets on 
the real line such that 


i)T.= [ t dn, is an @-measurable function of z, 


(ii) we = | T. dP» for all 6€Q. 
= 


If, for every x € X, the whole probability mass of 7, is concentrated at one 
point, say 7’, , then 7», (or equivalently T,) is called a nonrandomized estimator. 
With reference to a given loss or weight function w(t, @), which is a Borel- 
measurable function of the real variable ¢ for every fixed @ ¢ Q, an unbiassed 
estimator 7 of us is better than an alternative unbiassed estimator 7, at the 
point 6 = 6 if 


[ aPy, [ w(t, 00) dn. < | aPs, [ w (t,0) dn.. 
¢ ‘te 9 2 


We consider only such estimators n, for which {*., w(t, @) dnz is an @-measurable 
function of z for all @ ¢ Q. 

Hodges and Lehmann [2] noted that if, for every 6 ¢ Q, the loss function 
w(t, 6) is a convex (downwards) function of the variable ¢, then the class of non- 
randomized estimators of uw is essentially complete. Barankin [1] and Stein [4] 
considered the particular case where w = |t — ye|* for s = 1 and proved, under 
a few regularity assumptions, that there always exists an unbiassed estimator 
which is locally the best at a given value of @ = 6. Simple examples may be 
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given to show that there need not exist a uniformly best unbiassed estimator 
even in the simplest case of s = 2. If, however, there exists a complete sufficient 
statistic [3] for @ and if w is convex (downwards) for every fixed @ ¢ 2, then there 
exists an essentially unique uniformly best unbiassed estimator for every 
estimable parametric function ys . The convexity of the loss function is essential 
in the proofs of the above results. We demonstrate in the next section how a 
slight departure from the convexity of the loss function might destroy all these 
results. 


3. Nonexistence of a best unbiassed estimator. Let us assume that w(t, @) = 
w(use, 8) = O for all ¢ and @. That is, we assume that the loss function is non- 
negative and that there is no loss when the estimate hits the mark. Let U be 
the class of all unbiassed estimators 7, of us for which the risk function 


O|n) = Ew, 0) |, 6 = [ dPs [ w(t, 0) dn 


is defined for all 6. We prove the following 
THEOREM. If for every fixed @ € Q the loss function w(t, 0) is bounded in every 
finite interval \t — use| S A, and is of |t — ) as |t — wel — ©, then 
inf r(@ | ”) = 
7eU 
Proor. Let 7 = T, be a nonrandomized unbiassed estimator of ue. The 
existence of an unbiassed estimator clearly implies the existence of such a T,. 
Consider now the randomized estimator 7‘ = n°” which, for any z ¢ X, has its 
entire probability mass concentrated at the two points we, and (Tz; — ws,)/é 
+ jue, on the real line in the ratio 1 — 6 to 6, with 0 < 6 < 1. It is easily 
verified that 7 is an unbiassed estimator of us and that 


r(o | n) = Elw(t, 6) | n, 00] 
= Eléw(H/é + us,, 9%) | %l, H=T.z— way. 


Since w = o( |t — wo,| ) as |t — we,| > ©, given e > 0 we can determine A so 
large that 


w(t, ) Se \t so Heo |, l¢ =a Hey | = 
Let B= sup w(t, %) < o. Then 


|t—Hagi<A 


r(6 | n°”) -{f =m + us,, 9%) dPs, 
aa pe wi 
2 


Since « and 6 are arbitrary and B depends only on e, it follows that 
inf,-v 7(80 | ») = O. Since 4 is arbitrary, the theorem is proved. 
Now, if w(t, 6%) > 0 for t * ws, , then r(@ | ») can be zero only if nz gives 
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probability one to ye, for almost all z with respect to the measure P,, . In the 
usual circumstances, n, then would not be an unbiassed estimator of ps . 

Thus, this theorem shows that if we work with a loss function satisfying the 
conditions of the theorem, even locally best unbiassed estimators would not 
exist in all the familiar situations in which we are interested. In particular 
estimation problems, it will be easy to see that the theorem holds even in the 
restricted class U* of all nonrandomized estimators of u. In the next section we 
consider the classical problem of estimating the mean of a normal population, 
but with a slightly altered definition of variance. 


4. The case of the normal mean. Let x = (2, t2,-+--, %,) be a random 
sample from N(6, 1). The problem is to get a good unbiassed estimator of @ with 
the loss function 


(t — 6)’, t@-— el <a 
a’? |¢ — o|"?, jt — 6| >a, 


? 


w(t, 0) = 


where a is an arbitrarily large constant. 
Let Z and s” be the sample mean and variance, respectively, and let cs be the 


upper 1006 per cent point of the probability distribution of s*, where 0 > 6 > 1. 
Consider the nonrandomized estimator 


T® = TO ft . 


* <6, 
(Z — O)/6 + %, s>o. 


Since the distribution of s° is independent of @ and 2, it follows that T® is a 
function of z and 6 alone and that 7, for every fixed 6 with 0 < 6 < 1, is 
an unbiassed estimator of 6. Also 


r() | T°”) = Eléw{(z — 6) / 5 + O, 6} | Ol 


=i-— nlt = a|* 
J... 3( ; sa” |——_—_|__ (2) dz 
—89| sa 


[2-89 |>ad 6 
< ba” + 5°? a®” E(\z — Oo|"” | 6), 


where ¢(Z) is the frequency function of when 6 = 6). Thus r(4% | T)—0 
as 6 — 0. Therefore 
inf r(o| T) = 0, —2x <A< mw, 
Tevu* 
where U* is the class of all nonrandomized unbiassed estimators of 6. 

When the constant a is very large, the modification to the usual definition of 
variance apparently is very negligible, yet this slight change of variance com- 
pletely wrecks the theory of unbiassed estimation. Not even locally best un- 
biassed estimators exist, let alone a uniformly best one. 

In the construction of 7”, the independence of s* and Z is not essential. As a 
matter of fact, we can replace s* by any real-valued statistic Y whose conditional 
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distribution, given , is continuous. We then replace cs by c;(), where c;(2) is, 
say, the upper 1006 per cent point of the conditional distribution of Y given Z. 
From the sufficiency of Z it follows that c;(Z) is independent of 6, and the rest 
of the proof follows through. Under similar circumstances the general theorem 
proved earlier will remain true in the restricted class U* of all nonrandomized 
unbiassed estimators of ps . 
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ON CONFIDENCE INTERVALS OF GIVEN LENGTH FOR THE MEAN OF 
A NORMAL DISTRIBUTION WITH UNKNOWN VARIANCE 


By Lionet WeEtss! 


University of Virginia 


1. Summary. The problem of finding a confidence interval of preassigned 


length and of more than a given confidence coefficient for the unknown mean of 
a normal distribution with unknown variance is insoluble if the sample size used 
is fixed before sampling starts. In this paper two-sample plans, with the size of 
the second sample depending upon the observations in the first sample (as in 
{1]), are discussed. Consideration is limited to those schemes which increase the 
center of the final confidence interval by k if each observation is increased by k, 
and for which the size of the second sample is a function only of the differences 
among the observations in the first sample. Then it is shown that the mean of 
all the observations taken should be used as the center of the final confidence 
interval. Those schemes which make the size of the second sample a nondecreas- 
ing function of the sample variance of the first sample are shown to have certain 
desirable properties with respect to the distribution of the number of observa- 
tions required to come to a decision. 


2. Assumptions. We deal with an infinite sequence of independent and 
identically distributed chance variables, (X;, X:, ---). Each has a normal dis- 
tribution with unknown mean y and unknown standard deviation ¢. A positive 
integer n = 2 is given, and X,, --- , X, are observed. Then additional chance 
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variables are observed to the number of N(X.— Xi1, X; — Xi,---,Xn— X:). 
This last expression is a measurable function of X. — X,, --- ,X, — X;, which 
we shall often abbreviate to N. 

A fixed positive number A and a fixed 8, with 0 < 8 < 1, are given. We are 
going to study schemes that yield a confidence interval for » of length A, and 
of confidence coefficient greater than 8, regardless of the value of «. We limit 
consideration to schemes such that after the second sample has been taken, 
the center of the final confidence interval is a function C(X,,--- , Xn, Xnai, 

- , Xa+w) such that 


C(X, + k, cs Xn+n + k) = C(X,, aa ee Xn+w) + k, 


identically in k and in those values of X,, --- , Xn4w for which we take a total 
of n + N observations, for all values of N. 


3. The optimum center of the final confidence interval. Once a function 
N(X2 — X1,-°--, Xn — X:) has been assigned, then the distribution of the 
total sample size is determined. This distribution does not depend on uz for the 
class of schemes under consideration. Once we have chosen the function N, there 
is still the problem of assigning the center of the confidence interval, which will 
affect the confidence coefficient. (The confidence coefficient is a function of ¢ 
and of the function N, but not of u for the cases we are discussing.) We have 

Lemma 3.1. For any given function N(X2 — X1,--:, Xn — Xi), setting the 
center of the confidence interval at the mean of all the n + N observations taken 
maximizes the confidence coefficient uniformly in oc. 

Proor. The joint conditional density of X,,---, Xn, Xnii,+*+, Xnim, 
given that N(X, — Xi1,-°--, Xa, — X:) = m, is equal to 


H(a) exp {—Dot*™ (2; — u)*/20°} 


in the region where N = m, and zero elsewhere. The exact form of H(c) is of 
no concern to us. Denoting by X the mean of the values X,,--- , Xnim, We 
can write C(X,,--: , Xnim) as X + G(X, — X,--- , Xnzm — X), where the 
function G determines and is determined by the function C. A simple calculation 
shows that choosing G(X, — X,---, Xnim — X) identically equal to zero 
maximizes 


Pi(X + G@ — 44) <u < (X + G+ 44) | (N = m)} 


with respect to the function G for any value of ¢, thus proving the lemma. 

This calculation also shows that once we set X as the center of our confidence 
interval, the conditional probability, given that N = m, of having our confi- 
dence interval contain » when ¢ is the standard deviation is equal to 


+s Vet mp 


exp {—(n + m)/20°} dt, 


o 4/2 


and does not depend on which function N(X, — X,,---, X, — X:) is used. 
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We shall denote the quantity given in (1) by 8(¢, m). We note that 8(c, m) is 
increasing in m for any fixed o, and decreasing in o for any fixed m. For fixed 
m, 8(¢, m) approaches unity as o approaches zero. 


4. Properties of a certain class of functions N(X, — Xi, --- ,X, — Xi). 
We shall denote >> (X; — X)* by S(X), with X standing for the generic point 
in (X,,-°--, X,) space. The symbol P(R | c) stands for the probability of the 
region R when the standard deviation has the value c. We have 

Lemma 4.1. Given any function N(X2 — Xi, +--+ , Xn — Xi), any non-negative 
integer m, and any positive value o’, there exists a non-negative number d(o’, m) 
such that 


P[S(X) < d(o’, m)|o] = PIN(X: — Xi,---, Xn — X1) S m|o] 
according to whether o = o’. 
Proor. The two sets of values (X2 — Xi, --- , Xn — X:) and (X, — X, -:- 


? 


X, — X) determine each other uniquely. The joint density of the second set is 


p(X2 — X,---, X, — X) = (K/o”™") exp{ —S(X)/20’}, K constant. 


From this, it follows easily that if a. < o; < a2, then of all regions R in (X; 
— X,---,X, — X) space with P(R | 01) equal to a specified value, the region 
of the form S(X) < d, where d isa constant, minimizes P(R | c2) and maximizes 
P(R | oo). This proves the lemma. 


5. Optimum properties of a certain class of procedures for setting the size of 
the second sample. For any sample-size rule R, g(m, c, R) denotes the probability 
that the size of the second sample will be m when the standard deviation is ¢ 
and R is used, while G(m, o, R) denotes the probability that the size of the sec- 
ond sample will not exceed m under the same circumstances. Let 


B(c, R) = Yimmo g(m, ¢, R) B(s,m), ba, m, R) = Yio g(t, o, R) B(a, %). 


Thus B(c, R) is the confidence coefficient when R is used and a is the standard 
deviation. 

We shall say that a rule 7 setting the size of the second sample is of type S 
if it assigns size m to precisely those points X = (X.,---, X.) for which 


ana(T) < S(X) S a,(T), m= 0,1,2,---, 


where a_;(7'), ao(7’), --- are preassigned numbers such that 0 = a_,(T) < 
a(T) S a,(T) S --- . It is clear that for any rule T of type S, if for some posi- 
tive ¢ we have G(m, oc, T) approaching unity as m increases, then this happens 
for all c. 

Lemna 5.1. If T is a sample-size rule of type S such that for any finite o there is a 
finite positive integer s(¢, T) such that b{c, s(¢, T'), T| > 8, then for any o’ there is a 
finite positive integer N(o’, T) such that blo, N(o’, T), T| > 8B for alle S a’. 

Proor. From continuity considerations, it is easily seen that for any o in the 
interval (0, o’], if b[c, s(¢, T), T] > 8 then cg is in the interior of a nondegenerate 
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interval such that for any o” in the interval, b[o”, s(¢, 7), T'| > 8. By the Heine- 
Borel theorem, a finite number of such intervals can be found to cover the inter- 
val [6, o’] for any positive 6. This proves the lemma, since a neighborhood of 
zero causes no trouble. 

THEOREM 5.1. Given any sample-size rule R such that G(m, o, R) approaches 
unity as m increases for any value of ¢, and B(c, R) > B for all c, there is a rule 
R’ of type S, with B(c, R’) = B for all o, such that for any oa there is a nonnegative 
finite integer M(c) so that 


G(m, o, R’) = G(m, a, R), for allm > M(c). 


Proor. For the sake of definiteness, if 7 is any rule of type S, we shall under- 
stand that N(o’, 7) is the smallest positive integer such that b[o, N{c’, 7, T| > 
Bforalle S oa’. 

Let oo be the value of o which is the solution to B(¢, 0) = 8. Let 6 denote a 
fixed positive number, and o; denote oo + 75, for i = 1, 2, --- . Let Ro be the 
uniquely determined rule of type S such that g(m, oo, Ro) = g(m, oo, R) for 
all m. From the definition of op , for any o S ap there is a finite integer s(c, Ro) 
defined in Lemma 5.1. 

Also, by Lemma 4.1, G(m, o, Ro) S G(m, o, R) for all « > oo and for all m. 
This fact and the assumptions about R imply that for each ¢ > op there is a 
finite s(¢, R). We define N; as the larger of 1 or N(o;, Ro). By Lemma 5.1, N, 
is finite. 

Next we define another rule R, of type S, as follows. Take a;(R,) = a,(Ro) for 
a= —1,---,Ni +1, while a,(R,) fort > Ny + 1 is chosen so that G(i,o,, Ri) = 
G(i, o:1 , R). This is possible, since G(N; + 2, o,, R) 2 G(Ni + 2, o1, Ro). It is 
clear that for any o S o; there is a finite s(¢, R,). Also, by Lemma 4.1, G(m, co, 
R,) S G(m, o, R) for any ¢ > o, . This implies that for any o > o;, there is a 
finite s(o, Ry). Now we can define the finite number N; as the larger of N; + 1 or 
N(a2 ’ R,). 

Next we define a rule R, of type S, as follows. Take a;(R.) = a;(R,) fori = 
—1,0,--- ,N2+ 1, while a;(R,) fori > Nz + 1 is chosen so that G(i, a2, Re) = 
G(i, oz, R). By the same reasoning as above, we see that this is possible, and 
that we can define a finite N(c; , R.). Then we define N; as the larger of N2 + 1 
or N(e3 ; R.). 

In general, once having defined the rule R; of type S, we define a rule Rj4,; 
of type S, as follows. Take a;(Rj41) = a(R,;) fori = —1, --- , Nj41 + 1, while 
a;(Rj41) for i > Njis + 1 is chosen so that G(7, oj41, Rjas) = Gi, ojar, R). 
Then we define N ;,2 as the larger of N js; + 1 or N(oj;42, Rj41), and proceed as 
above. 

Clearly, for any fixed 7, lim;,.. a;(R;) exists. As a matter of fact, above a cer- 
tain j, a;(R;) remains constant. Finally, we define a rule R’ of type S by a;(R’) 
= lim;... ai(R;). Foreachj, B(o, R;) > 8 for any o. Therefore, it is clear from 
continuity considerations that for anyc, B(c, R’) = 8. Also, for any given o’, 
there is ak so that o’ < o. 
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Let m be any integer greater than N; + 1. Then there is a nonnegative in- 
teger j so that Nis; + 1 << m S Nuijai + 1. Then for all o, G(m, o, Resj4:) 
= G(m, oc, R’). Also, G(m, ons; , Rex 3) = G(m, ox4;, R). Therefore by Lemma 
4.1, G(m, o’, Rex;) 2 Gm, o’, R). Now by construction, G(m, ¢, Re+j+1) 

G(m, o, Re+;), for all oc. From these relations, we deduce that G(m, o’, R’) 
G(m, o’, R).This proves the theorem, for M(e’) can be taken as N, + 1. 


= 
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NEWS AND NOTICES 


Readers are invited to submit to the Secretary of the Institute news items of interest 
Personal Items 


Max Astrachan was on leave of absence from his position as Professor of Sta- 
tistics, USAF Institute of Technology, during July and August as Operations 
Analyst with Hq 49th Air Division in England. 

Maurice H. Belz has been appointed to the recently created Chair of Statistics 
in the University of Melbourne. 

Paul C. Clifford has been granted a leave of absence from Montclair State 
Teachers College to serve as Consultant on Quality Control for the Organization 
for European Economic Cooperation. This project, which is sponsored by the 
Foreign Operations Administration, will take Mr. Clifford to Norway, Sweden, 
Denmark, the Netherlands, Germany, France, Italy, and Turkey. 

Mr. Gerald D. Berndt, formerly with the Quartermaster Research and De- 
velopment Center at Natick, Mass., has accepted a special assignment with the 
Advisory Committee on Weather Control in Washington, D. C. The assignment 
is for 18 months and is concerned with analyzing meteorological experiments. 

Dr. H. A. David has resigned from his position with the Commonwealth Scientific 
and Industrial Research Organization of Australia to take up a Senior Lecture- 
ship in Statistics in the University of Melbourne. 

After receiving his Ph.D. from the Department of Mathematical Statistics of 
Columbia University, Cyrus Derman has joined the faculty of Syracuse Uni- 
versity as an Instructor in the Department of Mathematics. 

Frank M. duMas has joined the staff of the Department of Psychology, 
Montana State University. He formerly was a member of the faculty of Michigan 
State College. 

John W. Forman, formerly an instructor in Mathematics at the University 
of Kansas, is now an Applied Science Representative for International Business 
Machines Corporation in Kansas City, Missouri. 

Mr. Edwin L. Godfrey has accepted a position as Supervisory Mathematician 
in the Problem Formulation Section of the Statistical Services Division associated 
with the UNIVAC electronic computer at Wright-Patterson Air Force Base. 
He has just returned from New York where he was enrolled in a course in Pro- 
gramming conducted by the training division of Remington Rand, Inc. 

Roe Goodman has been transferred from the Technical Assistance Administra- 
tion of the United Nations to the Food and Agriculture Organization under 
whose program of technical assistance he has been sent to Chile in connection 
with the Chilean Agricultural Census, 1955. He and his family arrived in Santiago 
in September, 1954 for an assignment of one year. 

Dr. T. N. E. Greville formerly a statistician with the United States Opera- 
tions Mission to Brazil has been appointed Assistant Chief Actuary of the Social 
Security Administration in Washington, D. C. 
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Robert Hooke, of the Analytical Research Group, Princeton University, has 
accepted a position, starting in July, as a Research Mathematician with the 
Westinghouse Research Laboratories, East Pittsburgh, Pa. 

Arthur Horberg, formerly of the Institute for Air Weapons Research, has 
accepted the position of Mathematician with the Bendix Research Laboratories, 
Detroit. 

W. Robert Hydeman has been transferred by Remington Rand, Inc. to the 
Electronic Computer Department at 315 Fourth Ave., New York City. Mr. 
Hydeman’s activities as Sales Engineer include the application of electronic 
computers to engineering and scientific calculations and the analysis of data 
handling systems employing electronic computers as their central processing 
facilities. 

R. J. Jessen was on leave from the Statistical Laboratory, Iowa State College, 
for six weeks (from mid-February through March 1955) to serve as Technical 
Expert for FAO at a Latin American Demonstration Center on Agricultural 
Sampling Techniques being held in Buenos Aires, Argentina. 

B. S. Kawar is now at the University of Illinois on a half-time teaching as- 
sistantship while working on his Ph.D. in Statistics. 

Edward H. Kingsley is Operations Analyst in the Office of Operations Analysis, 
Deputy Chief of Staff/Operations, HQ, Air Proving Ground Command, Elgin 
Air Force Base, Florida. 

Miss Evelyn L. Rumer has recently married and is now Mrs. Edward C. 
Kramer. 

Upon completion of his Ph.D. in Statistics at Stanford University, Paul Meyer 
was drafted into the army. He is now stationed in Baltimore, at HQ-Materiel 
Command of the Chemical Corps, and is working as a statistician in the Quality 
Evaluation Division. 

M. Ray Mickey has resigned from his position as Assistant Professor in the 
Statistical Laboratory of Iowa State College to rejoin Rand Corporation, Santa 
Monica, California, in April 1955. He will be part of a group working on logistic 
problems for the U. 8. Department of Defense. 

O. B. Moan has accepted a position as Research Specialist in the Staff Engi- 
neering Department of Lockheed Aircraft Corp., Missile Systems Division, 
Van Nuys, California. 

Bruce D. Mudgett is doing a year of teaching at Brown University as Visiting 
Professor of Economics, filling in for a regular member of their staff who is spend- 
ing a year with the National Bureau of Economic Research in New York. 

Henry H. Peterson has accepted a position as Associate Engineer, Operations 
Analysis for the Douglas Aircraft Company at Long Beach, California. 

Dr. Bayard Rankin has accepted a position as Research Associate in the 
Department of Mathematics, Massachusetts Institute of Technology. 

Joseph S. Rhodes has resigned his position as Mathematical Statistician, De- 
partment of the Air Force, Washington, D. C. to accept a position as Methods 
and Operations Research Officer with the Chesapeake and Ohio Railway Com- 
pany, Cleveland, Ohio. 
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Dr. W. A. Shewhart has been given the first Honorary Professorship in Statisti- 
cal Quality Control by Rutgers University. He is at present in India and was this 
year elected to Honorary Membership in the Royal Statistical Society and 
awarded the Holley Medal of the American Society of Mechanical Engineers. 

Rosedith Sitgreaves has accepted a position as Research Associate on Air Force 
Project on “Discriminatory Analysis’’ at Teachers College, Columbia University, 
New York. 

Dr. Milton Sobel has accepted a position, starting Sept. 1, 1954, with the 
Bell Telephone Laboratories in Allentown, Pa. He was formerly working on an 
Air Force Research Contract at Cornell University. 

Bergen R. Suydam has accepted the position of Chief, Operations Analysis, at 
Headquarters, United States Air Forces in Europe. 

Kazuo Uemura has accepted the appointment by the World Health Organiza- 
tion of the United Nations to the post of Statistician in Geneva, Switzerland. 

John Richard Wittenborn has been appointed at Rutgers University as Uni- 
versity Professor of Psychology with assignment to the School of Education as 
Chairman of the Department of Pupil Personnel Service. This Department in- 
cludes measurements and statistics, guidance, and school psychology. 

In February Dr. E. 8. Watson assumed the position of Senior Fellow in the 
Department of Statistics, Research School of the Social Sciences, Australian 
National University, Canberra, Australian Central Territory. The Australian 
National University, founded about four years ago is entirely a postgraduate 
university. 

Martin B. Wilk, associate in the Statistical Laboratory and the Iowa Agri- 
cultural Experiment Station, has been awarded the George W. Snedecor Award 
in Statistics for 1955. The award, comprising a year’s membership in the IMS 
and a subscription to its Annals, is given annually to the most outstanding 
candidate for the Ph.D. degree in statistics at Iowa State College. 

Dr. Sewall Wright has retired from the University of Chicago (Department of 
Zoology) and has accepted the Leon J. Cole Professorship of Genetics at the 
University of Wisconsin. 


a 


Summer Offerings in Statistics at Iowa State College 


The Department of Statistics at Iowa State College will offer a course in de- 
cision theory at the advanced graduate level during the first half of the 1955 
summer quarter. The course will be taught by Dr. 8. L. Isaacson. Members of 
the graduate faculty in statistics will be available during most of the summer for 
consultation on graduate research and for special problems courses. Other offer- 
ings for the two six-week sessions (June 13-July 20 and July 20—August 26) of 
the summer quarter are designed mainly for the graduate minor in statistics and 
for the beginning graduate major in statistics who wishes to satisfy prerequisite 
requirements for more advanced courses. These additional offerings include 
“Statistical Methods for Research Workers”, “Statistical Theory for Research 
Workers”’, ““Experimental Designs for Research Workers’’, and “Survey Designs 
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for Research Workers”. For additional information, write to: T. A. Bancroft, 
Director, The Statistical Laboratory, lowa State College, Ames, Iowa. 


(ee 


Summer Offerings in Statistics at the University of Michigan 


During the Summer Session of 1955, June 20-August 12, the University of 
Michigan in Ann Arbor will offer a variety of programs and courses of interest 
to statisticians and students of mathematical statistics. The Department of 
Mathematics will present a number of elementary and advanced courses and 
seminars on theoretical statistics, probability, finite differences, and game theory. 
These courses will be under the direction of Professors A. H. Copeland, Sr., 
C. C. Craig, P. 8S. Dwyer, C. J. Nesbitt, and Dr. Clarke. A number of courses in 
applied statistics and statistical methods will also be given under the auspices 
of the Departments of Mathematics, Economics, Sociology, Psychology, and the 
School of Business Administration. 

In addition to the preceding courses, a number of special programs in allied 
fields will be offered. Among these are the Program in Survey Research Tech- 
niques, July 18-August 12, the Summer Institute in Mathematics for Social 
Scientists, June 20-August 12, the Program on Digital Computers and Data 
Processors, August 1-12, and the Program on Quality Control, August 17-27. 

Inquiries regarding courses in Statistics should be directed to Professor C. C. 
Craig, Department of Mathematics, and inquiries regarding the allied programs 
should be addressed to the Office of the Summer Session, 3510 Administration 
Building, Ann Arbor, Michigan. 
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Doctoral Dissertations in Statistics, 1954 


Listed below are the doctorates conferred during the year 1954 in the United 
States and Canada for which the dissertations were written on topics in statistics 
or related fields. The university, major subject, and the title of the dissertation 
are given in each case. 

S. G. Allen, Stanford, major in statistics, “A Class of Minimax Tests for One- 
sided Composite Hypotheses’’. 

S. L. Andersen, North Carolina State College, major in experimental statistics, 
“Robust Tests for Variances and Effect of Non Normality and Variance Hetero- 
geneity on Standard Tests’’. 

G. E. Baxter, Minnesota, “‘An Application of Stochastic Processes to Certain 
Problems in the Brownian Motion of Continuous Media”. 

D. R. Bey, Illinois, major in statistics, “Modified Discriminant Analysis of 
School Organizations’’. 

Allan Birnbaum, Columbia, major in mathematical statistics, ‘“Characteriza- 
tions of Complete Classes of Tests of Some Multiparametric Hypotheses with 
Applications to Likelihood Ratio Tests’. 
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Isadore Blumen, North Carolina, major in statistics, ‘“The Estimation of the 
Means of the Multivariate Normal Distribution: Minimax Solutions’’. 

R. N. Bradt, Stanford, major in statistics, “On the Design and Comparison 
of Certain Dichotomous Experiments”’. 

Barron Brainerd, Michigan, major in mathematics, ““An Algebraic Theory of 
Probability with Applications to Analysis and Mathematical Logic’”’. 

Leo Breiman, California (Berkeley), major in mathematics, ‘““Homogeneous 
Processes’’. 

P. P. Crump, North Carolina State College, Major in experimental statistics, 
“Optimal Designs to Estimate the Parameters of a Variance Component Model’’. 

M. B. Danford, North Carolina State College, major in experimental statistics, 
“Factor Analysis and Related Statistical Techniques’. 

Cyrus Derman, Columbia, major in mathematical statistics, “Some Contribu- 
tions to the Theory of Markov Chains’’. 

A. R. Eckler, Princeton, major in mathematics, ‘Rotation Sampling’. 

S. M. Free, Jr., North Carolina State College, major in experimental statistics, 
“Relationship of Instrument Color Measurements to Some Quality Indices of 
Flue-cured Tobacco”’. 

W. R. Gaffey, California (Berkeley), major in mathematical statistics, ‘“The 
Problem of Within-family Contagion”’. 

R. K. Getoor, Michigan, major in mathematics, “Some Connections Between 
Operators in Hilbert Space and Random Functions of the Second Order’’. 

A. H. E. Grandage, North Carolina State College, major in experimental 
statistics, ‘““Biological Assay of a Material when Interfering Substances are 
Present’’. | 

L. E. Grosh, Purdue, major in mathematics, ‘‘Uniform Distribution and 
Round-off Error’. 

J. E. Hafstrom, Minnesota, major in mathematics, “Non-Linear Transforma- 
tions in Wiener Space of the form, y(t) = z(t) + gt; x(t), ---, x(tn)]”. 

D. C. Hastings, Minnesota, major in statistics, “A Monthly Index of Regional 
Business Activity Based on Net Product Measurements”’. 

J. 8. Hunter, North Carolina State College, major in experimental! statistics, 
“‘Multi-factor Experimental Designs’. 

P. K. Ito, St. Louis, “On the Simultaneous Minimax Point Estimation”’. 

E. M. Jacobs, Iowa State College, major in statistics, “A Statistical Tech- 
nique for Estimating the Characteristics of Consumer Behavior’. 

Maurice Kennedy, California Institute of Technology, “Ergodic Theorems 
for a Certain Class of Markoff Processes’’. 

H. 8. Konijn, California (Berkeley), major in mathematical statistics, ‘‘On 
the Power of Some Tests for Independence’’. 

C. H. Kraft, California (Berkeley), major in mathematical statistics, ‘“On 
the Problem of Consistent and Uniformly Consistent Statistical Procedures’. 

R. B. McHugh, Minnesota, major in statistics, ““On the Scaling of Psychologi- 
cal Data by Latent Structure Analysis’. 
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P. L. Meyer, Stanford, major in statistics, ““An Application of the Invariance 
Principle to the Student Hypothesis”. 

T. V. Narayana, North Carolina, major in statistics, “Sequential Procedures 
in Probit Analysis’’. 

K. C. S. Pillai, North Carolina, major in statistics, ‘(On Some Distribution 
Problems in Multivariate Analysis’. 

J. H. Powell, Michigan State College, major in mathematical Statistics, “A 
Mathematical Model for Single Function Group Organization Theory with Ap- 
plications to Sociometric Investigations’. 

Bayard Rankin, California (Berkeley), major in mathematical statistics, “The 
Concept of Sets Enchained by a Stochastic Process and its Use in Cascade 
Shower Theory”’. 

W. C. Ross, Jr., State Univ. of Iowa, major in mathematics, ‘“‘Certain Func- 
tions of Order Statistics’’. 

Jacques Saint Pierre, North Carolina, major in statistics, ‘Distribution of 
Linear Contrasts of Order Statistics’. 

Herbert Scarf, Princeton, “Differential Operators on Manifolds, and Applica- 
tions to Stochastic Processes’. 

R. F. Schweiker, Harvard, ‘Individual Space Models of Certain Statistics’’. 

K. C. Seal, North Carolina, major in statistics, “On a Class of Decision Pro- 
cedures for Ranking Means’’. 

J. M. Shapiro, Minnesota, major in mathematics, ‘“An Error Estimate for 
the Convergence of Distributions of Sums of Independent Random Variables 
to Infinitely Divisible Distributions’’. 

Morris Skibinsky, North Carolina, major in statistics, “Some Properties of 
a Bayes Two-stage Test for the Mean”’. 

Harry Smith, Jr., North Carolina State College, major in experimental statis- 
tics, “Weighting Coefficients for Age-adjusted Death Rates’. 

D. §. Stoller, California (Los Angeles), major in statistics, “Asymptotically 
Optimum Discriminant Function Estimates’’. 

H. L. Stubbs, Boston, major in mathematics, ‘“Non-normal Models for the 
Classification of Speech Sounds’’. 

W. A. Thompson, North Carolina, major in statistics, “On the Ratio of Vari- 
ances in the Mixed Incomplete Block Model’”’. 

A. W. Wortham, Oklahoma A and M, “On Estimates of Variance Com- 
ponents”. 

Z. 8. Wurtele, Columbia, major in mathematical statistics, “Some Properties 
of Bayes Procedures which Improve Lot Quality”. 

— ee 
New Members 
The following persons have been elected to membership in the Institute 


November 17, 1954 to February 8, 1955 


Alder, Henry L., Ph.D. (Univ. of California), Assistant Professor of Mathematics, Uni- 
versity of California, Davis, California. 
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Ansari, Abdur Rahman, M.A. (Muslim Univ., Aligarh, India), Graduate Assistant, Math- 
ematics Department, University of Oregon, Eugene, Oregon. 

Bell, Earl Louis, M.S. (Northwestern Univ.), Mathematical Statistician, Department of 
the Air Force, School of Aviation of Medicine, Department of Biometrics, Randolph 
Air Force Base, Texas, 483 Trudel Drive, San Antonio, Texas. 

Birdsall, T. G., M.S. (Univ. of Michigan), Research Associate, Engineering Research In- 
stitute, University of Michigan, Ann Arbor, Michigan. 

Blasbalg, Herman, M.S.E.E. (Univ. of Maryland), Research Associate, Johns Hopkins 
University, 1315 St. Paul Street, Baltimore, Maryland. 

Brown, William Fuller, Jr., Ph.D. (Columbia Univ.), Research Physicist, Sun Oil Company, 
Sun Physical Laboratory, Newtown Square, Pennsylvania, 600 Parrish Road, Swarth- 
more, Pennsylvania. 

Burros, Raymond H., Ph.D. (Yale Univ.), Visiting Associate Professor of Psychology, 
University of Houston, Houston, Texas. 

Carroll, Mavis B. (Mrs.), B.Sc. (Rutgers Univ.), Project Leader—Statistician, Central 
Research Laboratories, General Foods Corporation, 11th and Hudson Sts., Hoboken, 
New Jersey. 

Doyle, Patrick J., B.S. (Univ. of Washington), Mathematical Statistician, Naval Ordnance 
Laboratory, Corona, California. 

Durbin, James, M.A. (Cambridge Univ.), Reader in Statistics, University of London, 
London School of Economics and Political Science, Houghton Street, London W.C.2, 
England. 

Elias, Peter, Ph.D. (Harvard Univ.), Assistant Professor of Electrical Engineering, Mas- 
sachusetts Institute of Technology, Cambridge 39, Massachusetts. 

Garratt, Alfred E., M.S. (Northwestern Univ.), Chief, Design and Analysis Office, BW 
Assessment Laboratories, Dugway Proving Ground, Dugway, Utah, 145 North 2nd 
Street, Tooele, Utah. 

Golub, Gene H., M.A. (Univ. of Illinois), Research Assistant, Electronic Digital Computer, 
162 Engineering Research Laboratory, University of Illinois, Urbana, Illinois. 

Graves, Clayborn L., M.S. (Univ. of Southern Calif.), Supervisory Mathematical Statisti- 
cian, Naval Ordnance Laboratory, Corona, California, 18882 E. Crescent Drive, Ana- 
heim, California. 

Gregory, Geoffrey, B.A. (Cambridge Univ.), Research Assistant and Student, Department 
of Applied Mathematics and Statistics, Stanford University, Stanford, California. 
Guha Thakurta, B. K., M.S. (Univ. of Calcutta), Assistant Superintendent of Statistics, 
State Statistical Bureau, Government of West Bengal, P.O.—Krishnagar, Dist.—Nadia, 

West Bengal, India. 

Hall, W. B., B.A. (Univ. of Melbourne), Research Officer, Commonwealth Scientific and 
Industrial Research Organization, Division of Mathematical Statistics, Graham Road, 
Highett, $21, Victoria, Australia, 188 Whitehorse Road, Balwyn E.8., Victoria, Aus- 
tralia. 

Helmstadter, G. C., Ph.D. (Univ. of Minnesota), Associate in Research, Educational 
Testing Service, 20 Nassau Street, Princeton, New Jersey. 

Homma, Tsuruchiyo, M.S. (Téhoku Univ.), Lecturer of Mathematics, Mathematical In- 
stitute, Faculty of Science, Tokyo Metropolitan University, 591 Fusuma-cho, Meguro- 
ku, Tokyo, Japan. 

Johnson, Millicent H., B.A. (Univ. of Minnesota), Teaching Assistant, Biostatistics 
Division, Department of Public Health, University of Minnesota, Minneapolis 14, 
Minnesota. 

Katz, Morris W., B.A. (McMaster), Graduate Student in Statistics, University of Llinois, 
Urbana, Illinois, 92 Hillcrest Avenue, Hamilton, Ontario, Canada. 

Kennedy, George H., A.B. (Princeton Univ.), Analytical Statistician (Biological Sciences), 
U.S. Army Chemical Corps, Camp Detrick, Frederick, Maryland. 
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Lentz, Gordon A., B.A. (Univ. of Minnesota), Graduate Student, University of Minnesota, 
Minneapolis 14, Minnesota, 7508 3rd Avenue South, Minneapolis, Minnesota. 

Mardessich, Bartolo, Doctor in Statistics (Univ. of Rome), Assistant Director, DOXA— 
Institute for Statistical Research and Public Opinion Analysis, Via Mameli, 10— 
Milano, Italy, Viale Lucania, 21—Milano, Italy. 

Miller, Irwin, M.S. (Purdue Univ.), Graduate Research Assistant. Department of Statistics, 
Virginia Polytechnic Institute, Blacksburg, Virginia. 

Neff, M. Carolyn, B.S. (Western Reserve Univ.), Administrative Assistant, Development 
Division, Lecturer, Secretarial Sciences, School of Business, Graduate Student, School 
of Business, Western Reserve University, 2040 Adelbert Road, Cleveland 6, Ohio. 

Oakford, Robert Vernon, B.A. (Stanford Univ.), Graduate Student in Statistics, Stanford 
University, Stanford, California, 10 Maywood Lane, Menlo Park, California. 

Peries, H, E., M.A. (Cantab), Director of Census and Statistics, Department of Census 
and Statistics, Government of Ceylon, P. O. Box 563, Colombo, Ceylon, 16, DeFonseka 
Road, Colombo 5, Ceylon. 

Pinkham, Roger S., A.M. (Harvard Univ.), Graduate Student, Harvard University, Cam- 
bridge, Massachusetts, 203C Holden Green, Cambridge, Massachuseits. 

Polowy, Henry, M.S. (New York Univ.), Instructor of Mathematics, Stevens Institute of 
Technology, Hoboken, New Jersey. 

Robinson, William Keith, B.S. (Univ. of Washington), Actuary, Old American Life Com- 
pany, 1619 Terry Avenue, Seattle 1, Washington. 

Schlaifer, Robert, Ph.D. (Harvard Univ.), Associate Professor of Business Administration, 
Graduate School of Business Administration, Harvard University, Boston 63, Mas- 
sachusetts. 

Schneider, Robert C., B.S. (Seattle Univ.), Mathematical Statistician, Naval Ordnance 
Laboratory, Corona, California. 

Shellow, James M., B.A. (Univ. of Chicago), Director of Research, Henry Shellow and 
Associates, 1012 North Third Street, Milwaukee, Wisconsin. 

Shugert, Richard F., B.S. (Rutgers Univ.), Engineer Aerodynamicist, Applied Aeronautical 
Research, Stroukoff Aircraft Corporation, West Trenton, New Jersey. 

Silvey, S. D., Ph.D. (Glasgow Univ.), Lecturer in Mathematics, University of Glasgow, 
Glasgow W.2., Scotland. 

Star, Martin L., B.B.A. (City College of New York), Quality Control Supervisor, Sonotone 
Corporation, Cold Spring, New York, 2270 81st Street, Brooklyn 14, New York. 

Stiver, Ray B., Jr., M.B.A. (Univ. of Buffalo), Dynamics Engineer, Bell Aircraft Corpora- 
tion, Wheatfield Plant, P.O. Box 1, Buffalo 5, New York, 914 Michigan Avenue, Buffalo 
3, New York. 

Troll, John H., M.A. (Harvard Univ.), Vice President for Engineering, Electronics Cor- 
poration of America, 77 Broadway, Cambridge 42, Massachusetts. 

Udinski, W. P., Ph.D. (Univ. of Illinois), Mathematician, Human Resources Research 
and Personal Training Section, Machine Processing Branch, Lackland Air Force 
Base, San Antonio, Texas, 714 South Palmetto Avenue, San Antonio 3, Texas. 

Yarnold, James K., B.S. (Trinity Univ.), Research Assistant, Statistical Laboratory, 
University of California, Berkeley 4, California. 

Yasuoka, Yoshinori, M.Sc. (Tokyo Univ.), Lecturer of Mathematics, Mathematical In- 
stitute, Faculty of Science, Tokyo Metropolitan University, 591 Fusuma-cho, Meguro- 


; 


ku, Tokyo, Japan, 2-242 Okuzawa-cho, Setagaya-ku, Tokyo, Japan. 





REPORT OF THE TREASURER FOR 1954 


The report of the treasurer has been delayed this year on the recommendation 
of the treasurer and the concurrence of the Executive Committee in order that 
the report of the auditor might be submitted to the membership along with that 
of the treasurer. Because of the timing of publication of the Annals, it has been 
possible to give financial statements which more accurately reflect the position 
of the Institute on December 31, 1954 without introducing any additional delay 
in the publication of the reports. In previous years the books of record of the 
Institute have been closed in time to prepare financial reports for presentation 
at the Annual Meeting. While Statements of Income and Expense accurately 
reflected the activities through the whole of the calendar year, the Statements 
of Financial Condition did not include the cash receipts and the associated 
liabilities to members and subscribers which accumulated during the latter part 
of December. The Statement of Income and Expense submitted herewith can 
properly be compared with those for the last few years. The Statement of Finan- 
cial Condition will differ in the three accounts mentioned in part because of the 
change in time of preparation. 

Income and expenses are both higher in 1954 than in 1953. The increase in 
surplus is much lower in 1954 than in the two preceding years. Expenses increased 
for several reasons. One major increase was due primarily to the increase from 
708 numbered pages in the Annals in 1953 to 826 pages in 1954. Along with an 
increase in the number of copies distributed, this item accounts for a $2,141.94 
increase in expenses. Printing costs are rising and the cost per page will increase 
markedly in 1955. Another major increase in expenses resulted from the decision 
of the Council that at least twice a year the Executive Committee should meet 
in person and that, to assure these meetings, the Institute should pay travel 
expenses of members of the Executive Committee to the Summer Meeting and 
to the Annual Meeting each year. This item amounted to $990.98 in 1954. Other 
factors in the increase in expenses were the larger volume of material mailed to 
members by the Secretary and the use of Air Mail in distributing ballots on two 
occasions during the year to members outside the United States and Canada. 

The decrease in dues rate for members outside the United States and Canada 
which became effective in 1954 would have produced a decrease of $410 in in- 
come if no new members had been accepted. Increases in membership and re- 
instatements reduced the decrease to $186. An increase of $354 from individual 
members in the United States and Canada (in spite of a reduction in student dues 
rate), $300 from institutional members and $828.35 from nonmember subscrip- 
tions to the Annals gave a net increase of $1,296.35 from dues and subscriptions. 
Changes in the form of investments and increased investments yielded $1,350.18, 
an increase of $1,025.18 over 1953. 

The financial reports follow. 
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THE INSTITUTE OF MATHEMATICAL STATISTICS 


Statement of Financial Condition 
December 31, 1954 
ASSETS 
Current Assets 
Cash in Bank aT, 
Savings Bank Account................. 
Investments (at cost)*... 
Dues Receivable 
Due on Back Issue Sales 
Miscellaneous Accounts Receivable 
Inventory of Back Issues. . . 


Total Assets 
LIABILITIES AND MEMBERS EQUITY 


Current Liabilities 


Account Payable, Printing of Dec. Annals (Estimate) 
Withholding and F.I.C.A. taxes payable. . 

Accrued Expenses aa 
Amount Held for Biometrika subscriptions... . 
Amount Held for Wald Memorial volume. . 


Total Current Liabilities 


Liabilities to Members and Subscribers 


Advances on Dues, 1955, 1956... .... 
Advance on Dues, life members 
Advances on Subscriptions 


Total Liabilities to Members and Subscribers 


Members Equity 
Reserve for Maintaining Supply of Back Issues... 
Surplus. . 


Total Members Equity 


Total Liabilities and Members Equity 
Statement of Income and Expenses 
for the period January 1, 1954 through December 31, 1954 
Revenues 


Membership Dues 
Subscriptions. . 


$14,115.00 
10,071.00 


.. $10,689.37 


2,058.98 
43,771.51 
73.00 
1,107.20 
104.65 
18,524.28 


$76 , 328.99 


14,430.86 


56,115.06 


76,328.99 


* The book value of investments is cost. The cash value is $6 higher. During 1954, $20,000 
was transferred from Time Certificates of Deposit to Savings and Loan Associations. In- 
vestments at the end of 1954 were $11,888 in U. S. government bonds and $31,883.52 in 
Savings and Loan Associations. These amounts are fully insured by Agencies of the U. S. 
government, 
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Sale of Back Issues: 
Gross Income $5,498.83 
Cost of Sales. 1,884.04 3,614.79 


Income frora Investments 1,350.18 
Miscellaneous Revenue........... oe Py 534.33 


Total Revenue 5 ete iets 29,685.30 


Expenses 


Printing of Current Annals: 
Cost of current issues printed $13,927.98 
Inventory, December 31, 1954 of 1954 Annals..... 1,448.13 12,479.85 


Editorial Expense 849.69 
Miscellaneous Printing, Stationery | Postage. . 2,873.40 
Miscellaneous Office Expense. . ois 755.60 


Salary. ...:.. ogi sw 3,360.00 
Contributions.... Le ne eines 151.73 


President’s Office Expenses _ eae 131.00 


Travel om Se LSS as 990.98 
Wa Ms Mike Ag WR on 55a 0 609-0 


Total Expenses . 
Excess of Revenues over Expenses i 
Minus Addition to Reserve for Maintaining Supply of Back Issues 


Increase in Surplus 
Surplus, December 31, 1953 


Surplus, December 31, 1954............... ... $34,362.54 


K, J. ARNOLD 
Treasurer 


a 


I have examined the financial statements of the Institute of Mathematical 
Statistics as of December 31, 1954 consisting of a Statement of Financial Condi- 
tion as of this date and a Statement of Revenue and Expense for the year then 
ended. 

My examination was made in accordance with generally accepted auditing 
standards and included such tests of the accounting records as were considered 
necessary. The inventory of back issues was not physically examined, but direct 
confirmation was received from the independent agent who is responsible for the 
physical control of such inventory. Accounts receivable were not confirmed; 
however, they were not considered to be of a material amount. 

In my opinion, the accompanying Statement of Financial Position and the 
Revenue and Expense statement present fairly the financial position of the In- 
stitute of Mathematical Statistics as of December 31, 1954 and the results of 
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its operations for the year then ended, in conformity with generally accepted 


accounting principles applied on a basis consistent with that of the preceding 
year. 


CHARLES LAWRENCE 


CPA (Michigan) 


PUBLICATIONS RECEIVED 


Jones, B. W., The Theory of Numbers, Rinehart and Co., Inc., New York, 1955, xi, 143 pp., 
75. 

Selected Papers in Statistics and Probability by Abraham Wald, McGraw-Hill Book Co., Inc., 
New York, 1955, ix + 702 pp., $8.00. 

TueiL, H., Linear Aggregation of Economic Relations, North-Holland Publishing Co., 
Amsterdam, 1954, xi + 205 pp., $5.00. 

Tinbergen, J., Centralization and Decentralization in Economic Policy, North-Holland 
Publishing Co., Amsterdam, 1954, viii + 80 pp., $1.80. 





TRABAJOS DE ESTADISTICA 


Review published by ‘Instituto de Investigaciones Estadisticas’’ of the ‘‘Consejo 
Superior de Investagaciones Cientificas.’’ Madrid, Spain. 


Vol. V CONTENTS Cuad, II 


J. NEYMAN......Sur une famille de tests asymptotiques des hypotheses statistique 


; composées- 
P. Zoroa eens de variables eotterae y sus aplicaciones. 


i Wns. cae é _........A Problem of Encounters. 
NOTAS 


Cuar.es A. Spoert .La ciencia actuarial: Una visién general de su desarrollo 


tedrico. 

J. Royo y 8. Ferrer......Tabla de nimeros aleatorios obtenida de los nimeros 
de la Loteria Nacional Espafiola. 

E. Luacunez y J, L. TenpErRo......Notas para la ensefianza de algunos conceptos 
elementales de Estadistica en el Bachillerato. 


Crénicas Bibliografia Cuestiones 


For everything in connection with works, exchanges and subscriptions write to Prof. 
Sixto Rios. Departamento de Estadistica del Consejo Superior de Investigaciones 
Cientificas, Serrano 123, Madrid, Spain. The Review is composed of three fascicles 
published quarterly (about 350 pages) and its price is 80 pts. for Spain and South- 
America and 3 American Dollars for all other countries. 


THE AMERICAN STATISTICAL ASSOCIATION 


announces the publication of two new monographs: 


Statistical Problems of the Kinsey Report 


by CocHRAN, MOSTELLER and TuxEy. The evaluation of the statistical methodology used by 
Kinsey and his associates in their first volume. This study was requested by the Committee 
for Research in the Problems of Sex of the National Research Council, which is sponsoring 
Dr. Kinsey’s work. The contents include Statistical Problems of the Kinsey Report, Dis- 
cussion of comments by Selected Technical Reviewers, Comparison with Other Studies, Pro- 
posed Further Work, Probability Sampling Considerations, The Interview and The Office, 
Desirable Accuracy, Principles of Sampling. 

The monograph contains 331 pages, plus a foreword, preface and index; bound in blue 
buckram; $3.00 to ASA members; $5.00 to others. 


Proceedings of the Business and Economics Statistics Section. 


The papers given at the sessions sponsored by the Business and Economics Statistics Section 
at the Annual Meeting of the American Statistical Association in Montreal in September 
1954. This volume contains papers on Pension Funds, Business Outlook, International Pay- 
ments, Consumer Survey Data, Forecasting, Employment and Unemployment Statistics, 
Stock Market, Government Statistics, Measurement of Saving and Investment, Mobilization, 
Productivity. 
Approximately 250 pages, paper bound. Price $2.00 to ASA members; $3.00 to others. 

Copies may be ordered directly from the AMERICAN STATISTICAL ASSOCIATION, 1108 Sixteenth 
Street, N. W., Washington 6, D. C. 





ECONOMETRICA 


Journal of the Econometric Society 
Contents of Vol. 23, No. 2 - April, 1955 


NS GMAOBS. ods oc csncecessvcatbadikecadenase The Life Cycle in Income, Saving, and Asset Ownership 


- D. SarGan: ... The Period of Production 
. J. GARTAGANIS anp A. 8S. GoLDBERGER: 


A Note on the Statistical Discrepancy in the National Accounts 
Georce B. Danrzie: 


eee Bounds, y eey Cae and Block Triangularity in Linear Programming 
Ns... Recent Experiences with the Munich Business Test 
Hersert A. Smron: Causality and Econometrics: Comment 


Herman O. A. Woxp. on Causality and Econometrics: Reply 
REPoRT ON THE UPPsaLA MEETING 
Boox Reviews: 


The Measurement o Cepeumere Repeaters and Behavior in the United Kingdom, 1920-1988 (Richard Stone). 
Review by Arnold 


Premiers éléments @’ — a ité nationale de la B , 1948-1951. Review by J. Marczewski. 
4 Study in the Analysis of Time Series (H. td. Review by M. 8. Bartlett 
he Dalia: : The History and Peution of the Dollar in World Affairs (Roy F. Harrod). Review by K. K. Kuri- 


pee. for Decision (Irwin D. J. Bross). Review by Stanley Isaacson. 

Gleichgewichtskalkulation, Untersuchungen zur Mazimalgewinnberechnung des Betriebes (Adolph Angermann) 
Review by Hans Peter. 

Statistical Inference (Helen M. Walker and Joseph Lev). Review by Acheson J. Duncan. 

Source Readings in Economic Thought ae C. Newman). Review by Louise Sommer. 


ees internationauz du centre national de la recherche scientifique, XL. Econometrie. Review by W. M. 
r 


Methods of Crop Forecasting (Fred H. Sanderson). Review by Karl A. Fox. 


Published Quarterly Subscription rates available on request 
The Econometric Society is an international society for the advancement of economic theory in its 
relation to statistics and mathematics 


Subscriptions to Econometrica and inquiries about the work of the Society and the procedure in applying 
for membership should be addressed to Richard Ruggles, Secretary, The Econometric Society, Box 
1264, Yale University, New Haven, Connecticut. 


BIOMETRIKA 


Volume 42 Contents Parts 1 and 2, June 1955 


DAVID, F. N. Studies in the history of probability and statistics. 1. Dicing and gaming (A note on the 
history of probability). POWELL, E. O. Some features of the generation times of individual bacteria. 
BROADBENT, 8. R. Quantum hypotheses. SAMPFORD, M. R. The truncated negative binomial 
distribution. WILK,M.B. The randomization analysis of a generalized randomized block design. PAGE, 
E. 8. Control charts with warning lines. COX, D. R.and STUART, A. Some quick sign tests for trend 
in location and dispersion. ANIS, A.A. The variance of the maximum of partial sums of a finite number 
of independent normal variates. THOMPSON, H. R. Spatial point processes, with applications to ecol- 
ogy. WHITTLE, P. The outcome of a stochastic epidemic—A Note on Mr. Bailey’s paper. FOSTER, 
F. G. A Note on Bailey’s and Whittle’s treatment of a general stochastic epidemic. RUSHTON, 8. and 
MAUTNER, A. J. The deterministic model of a simple epidemic for more than one community. HAN- 
NAN, E. J. Exact tests for serial correlation. BARTLETT, M. 8. and MEDHI, J. On the efficiency of 
procedures for smoothing periodograms from time series with continuous spectra. WISE, J. The auto- 
correlation function and the spectral density function. JOWETT, G. H. Sampling properties of local 
statistics in stationary stochastic series. HEINE, V. Models for two-dimensional stationary stochastic 
processes. GANI, J. Some problems in the theory of provisioning and of dams. BARTLETT, M. 8. 
Approximate confidence intervals. III. A bias correction. TATE, R.F. The theory of correlation between 
two continuous variables when one is dichotomized. EDITORIAL. The normal probability function: 
Tables of certain area-ordinate ratios and of their reciprocals. DAVID, F. N. and KENDALL, M. G. 
Tables of symmetric functions. Part V. Miscellanea: Contributions by DUNNETT, C. W. and SOBEL, 
M., HODGES, J. L., CHU, J. T., HALDANE, J. B. 8., THOMSON, G. W., GULLAND, J. A. Reviews 


The subscription price, payable in advance, is 45s. inland, 54s. export (per volume including postage). Cheques 
should be drawn to Biometrika and sent to “The Secretary, Biometrika Office, Department of Statistics, 
University College, London, W.C. 1.” All foreign cheques must be in sterling and drawn on a bank 
having a London agency. 





MATHEMATICAL REVIEWS 


A journal containing reviews of the mathematical liler- 
ature of the world, with full subject and author indices 


Publication of this journal is sponsored by the American Mathe- 
matical Society, Mathematical Association of America, Institute of 
Mathematical Statistics, London Mathematical Society, Edinburgh 
Mathematical Society, Union Matematica Argentina, and others 


Subscriptions accepted to cover the calendar year only. 
Issues appear monthly except July. $20.00 per year. 


Send subscription order or request for sample copy to 


AMERICAN MATHEMATICAL SOCIETY 
80 Waterman Street, Providence 6, Rhode Island 


JOURNAL OF THE 


ROYAL STATISTICAL SOCIETY 


Series B (Methodological) 


Vol. XVI, No. 1, 1954 
F. G. Foster AND A. STUART Distribution-free Tests in Time-Series Based on the 
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